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TITLE OF THE INVENTION 
POLYMORPHIC DNA MARKERS IN BOVIDAE 

CROSS-REFERENCE TO REIATED APPLICATIONS 

■f 

The present invention is a continuation-in-part of 
5 U.S. Ser. No. 642,342, filed January 15, 1991, incorpo- 

rated herein by reference. 

FIEM? QF THE lyVfiNTIPtJ 

The invention relates to gene mapping, selective 
breeding and genetic identification in domestic animals. 

10 BACKGROUND OF THE INVENTION 

The publications and other materials used herein to 
illuminate the background of the invention, and in 
particular, cases to provide additional details respect- 
ing the practice, are incorporated by reference and for 
15 convenience are numerically referenced in the following 

text and respectively grouped in the appended bibliog- 
raphy. 

Until recently, artificial selection has relied on 
the biometrical evaluation of individual breeding values 

20 from an animal's own performance and from performance of 

its relatives (136) . This biometrical strategy is based 
on relatively simple genetic premises, operating within 
a "black box**. Briefly, the majority of economically 
important traits are so-called complex or quantitative 

25 traits, meaning that the phenotype of an animal is 

determined by both environment and a large number of 
genes with individually small, additive effects. The 
proportion of the phenotypic variation observed in a 
given population that is genetic in nature is the 

30 heritcibility of the trait. Substantial genetic progress 

has been obtained using this approach. One of the 
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powers of this biometrical approach is that it obviates 
the need for any detailed molecular knowledge of the 
underlying genes or Economic Trait Loci. 

However, it is believed that the molecular 
5 identification of these Economic Trait Loci could 

increase genetic response by affecting both time and 
accuracy of selection, through a procedure called Marker 
Assisted selection (91, 96). One strategy towards the 
isolation of Economic Trait Loci relies on the use of 

10 DNA Sequence Polymorphisms as genetic markers in linkage 

studies. This approach, paradoxically referred to as 
"Reverse Genetics" (138) , will be described in detail in 
this introduction. Moreover, we propose a new concept 
called "Velogenetics", or the combined use of Marker 

15 Assisted introgression and germ-line manipulations to 

shorten the generation interval of domestic species 
(especially cattle), which will allow the rapid and 
efficient introgression of mapped Economic Trait Loci 
between genetic backgrounds. 

20 I. DNA SEQUENCE POLYMORPHISM (DSP) 

A. Types of DNA Secpience Po lymorphism 

The typical mammalian genome is composed of an 
approximately 3x10° base pairs long DNA stretch, divided 
over a species-specific number of chromosomes, and 

25 containing all the information required for the proper 

development and functioning of a normal being. Each 
individual has two copies of this message: one paternal 
in origin and one maternal. Although their overall 
architecture and content are virtually identical, the 

30 paternal and maternal DNA sequences exhibit subtle 

"allelic" differences, hereinafter referred to as DNA 
sequence Polymorphisms or "DSP". The DSP that can be 
recognized in a given population are the molecular basis 
of the genetic con^onent of the observed phenotypic 

35 variance. One can distinguish three types of DSP. 
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1- g Angle Patse Pajr PQly^flQyphAgms 
As their name implies, these DSP are due to single 
base pair differences distinguishing alleles. These can 
be either base pair substitutions - transitions (Purine 
5 to Purine or Pyrimidine to Pyrimidine) and transversions 

(Purine to Pyrimidine and vice versa) or the inser- 
tion/deletion of a single base pair. 

The frequency of single base pair polymorphism is 
measured by the nucleotide diversity, or average 

10 heterozygosity per nucleotide site (1) • The nucleotide 

diversity has been estimated from Restriction Fragment 
Length Polymorphisms at 0,002 for human (2), and at 
0,0007 in cattle (3,4). This means that on the average 
a human will be heterozygous for one every 500 nucleo- 

15 tides, and a cow for one every 1,500 nucleotides. 

One type of single base pair polymorphism deserves 
special attention: the CpG to TpG transition. The 
cytosine in the CpG dinucleotide sequence is known to be 
the substrate of an eucaryotic methylase, which will add 

20 a methyl group in position 5 of the pyrimidine ring, if 

the cytosine of the complementary CpG dinucleotide is 
itself methylated. Deamination of a 5-methylcytosine 
generates a thymine, blurring the task of the DNA repair 
machinery which will half of the time resolve the 

25 ensuing mismatch by replacing the original guanine 

instead of the mutated thymine. As a consequence, cyto- 
sines in the CpG doublet exhibit mutation rates at least 
ten times higher than other nucleotides, and hence are 
rich sources of single base pair polymorphisms (4, 5). 

30 2. DNA Sequence Rearrangements 

In this kind of DSP, the difference between allelic 

variants involves DNA sequence rearrangements such as 

the insertion or deletion of a stretch of DNA, DNA 

sequence inversions and duplications. 
35 Although there is a wide spectrum of molecular 

mechanisms susceptible to generate such chromosoioiial re- 



09/3 0/2004, EAST Version: 1.4.1 



wo 92/13102 



-4- 



PCr/US92/00340 



azrrangements, it is well established that mobile genetic 
elements significantly contribute to this kind of DSP. 

In lower eukaryotes such as Drosophila and yeast, 
rearrangements involving transposable elements account 
5 for a large proportion of new mutations detected in 

these organisms (6) • In the mouse, retrovirus-like 
sequences or retrotremsposons have been shown to act as 
insertional mutagens (7-11) , and different strains of 
mice exhibit substantial heterogeneity with respect to 
10 the numbers and chromosomal sites of endogenous pro- 

viruses (12). Variation in the distribution of endoge- 
nous retroviruses has been demonstrated in poultry as 
well . 

In the human, at least 10% of the genome is known 

15 to be composed of retroposon-like sequences. Evidence 

for a role of these sequences in human genetic varia- 
bility and disease stems from several repoirts of de novo 
mutations due to these sequences: a mutation in the 
humem Low Density Lipoprotein receptor gene giving rise 

20 to familial hypercholesterolemia is caused by a deletion 

brought sibout by an intrastrand recombination event 
between two Alu secfuences (13) ; LI insertions were found 
to inactivate the factor VIII gene in hemophilia A 
patients (14) ; a c-myc rearrangement in a breast carcin- 

25 oma was found to be due to insertion of an LI element 

(15) ; cin Alu tramsposition event has been documented in 
human lung carcinoma cells (16) ; and an homologous 
recombination between the LTRs of a human retrovirus-- 
like element was shown to cause a 5 Kb deletion poly- 

30 morphism. Recently, Wong et al. (17) reported evidence 

of human DNA polymorphism arising through DNA-mediated, 
rather than RNA-mediated, transfer between autosomes as 
well . 

3. p ypansion-Contraction Type Polymorphism 
35 A significant proportion of the eucairyotic genome 

is composed of sequences widely termed "satellite DNA," 
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sharing a conuaon organization: a sequence motif, varying 
in length between one and several thousand nucleotides, 
repeated in a head-to-tail or so-called tandem arrange- 
ment. Depending on the methodology originally used for 
5 their study, i.e. isopycnic centrifugation, pulsed field 

gel electrophoresis, agarose gel electrophoresis or 
polyacrylamide gel electrophoresis, satellite sequences 
were grouped into four size classes: macro-, midi-, 
mini- and micro-satellites. Minisatellites are also 

10 known as Variable Number of Tandem Repeats (VNTRs) (18- 

21) . While macro-satellites seem to be confined to 
heterochromatic regions (22), mini- and micro-satellites 
have been found scattered throughout the genome with, 
however, clustering of mini-satellites (23-34) . In the 

15 human, minisatellite clusters seems to be particularly 

abundant in preterminal regions (35) . The only midi- 
satellite described as such today, has been mapped to 
the short arm of chromosome 1 (36) . In the human, the 
polydeoxyadenylate tract of Alu repetitive elements are 

20 also caracterized by length variation and are thus an 

abundant source of genetic markers as well (37). 

The function, if any, of satellite sequences, 
whether macro-, midi-, mini- or micro-, is essentially 
unknown. An important feature of all satellite sequences 

25 is that the maintenance of their tandemly repeated 

organization is dependent on the concerted evolution of 
the repeats. This concerted evolution is thought to 
result from subsequent rounds of unequal crossing-over 
(or any other mechanisms fitting the "card deck" model 

30 (38) )f which are favored by the tandemly repeated 

structure itself. The proposed unequal crossing-over 
mechanism, whether happening between sister chromatids 
or homologous chromosomes, explains the substantial 
degree of length polymorphism, here referred to as 

35 "expansion-contraction polymorphism, " characterizing 

those sequences. Moreover, the ensuing shuffling of 
slightly divergent repeat units or Minisatellite Variant 
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Repeats (39) within the satellite generates additional 
internal site polymorphism. These peculiar properties 
of satellite sequences have made them an invaluable 
source of highly informative genetic markers, both in 
5 the human and in domestic species (reviewed in 31) . 

B. Detection of DNA Sequence Polvmorphism 

During the last ten years ^ a multitude of methods 
have been developed for the detection of DSP. Two 
techniques, however, undoubtedly dominate this field: 
10 Southern blot hybridization (40) and the Polymerase 

Chain Reaction or PGR (41) , used either separately or in 
conjunction. A non-exhaustive list is reported here, 
the methods being grouped into four classes. 

1. Restriction Pattern Analysis 

15 DSP may alter the restriction patterns of defined 

chromosomal regions, generating so-called Restriction 
Fragment Length Polymorphisms (RFLP) . Depending on the 
size-range of the explored restriction fragments, one 
will use either agarose gel electrophoresis, pulse-field 

20 gel electrophoresis (30) or polyacrylamide gel 

electrophoresis (42) for intermediate, large or small 
fragments respectively. RFLPs are classically detected 
by Southern blot hybridization. Alternatively, one can 
analyze restriction patterns of defined DNA sequences 

25 amplified by PGR, generating so-called Amplified 

Sequence Polymorphisms (43). When studying chromosomal 
rearrangements or expansion-contraction type polymorph- 
isms, the use of PGR obviates the need for restriction 
enzyme digestion, the DSP reflecting itself in the size 

30 of the amplified product. 

Because of its simplicity, the detection of RFLPs 
has by far been the most popular approach towards DSP. 
The relative lack of power inherent to the method (only 
20% of a given sequence is amenable to exploration using 

35 the most common restriction enzymes) can be compensated 
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for by focusing on highly polymorphic sequences such as 
CpG dinucleo-tides (using enzymes such as Tao l and Msp l 
containing CpG in their recognition sequence) or hyper- 
variable minisatellites. The discovery, however, of 
5 microsatellites as a very abundant source of highly 

informative DSP in a broad taxonomic range, easily 
detectable by PCR, is likely to shift the focus towards 
these sequences for future marker development (32-34, 
37, 44) . 

10 2* Mismatch Analysis 

Several methods for the detection of DSP are based 
on the study of mismatch analysis. DNA to analyze is 
probed with a sequence corresponding to a defined 
genetic variant. The presence of a different variant in 

15 the target DNA generates a mismatched heteroduplex, 

which can be detected by various means: 

a. Detection of Altered Melting Behavior 
A mismatched heteroduplex will differentiate itself 
from the perfectly matched homoduplex by an altered 
20 melting behavior which can be detected as an all-or- 

none, binary response: positive for the homoduplex, 
negative for the heteroduplex, or in a more graded 
response, allowing to distinguish between different 
heteroduplex variants. 

The classical all-or-none test depends on the use 
of allele-specif ic oligonucleotides in hybridization 
experiments. By choosing appropriate hybridization and 
washing conditions, the allele-specif ic oligonucleotide 
will only recognize a perfectly complementary sequence 

(45) . With the advent of PCR, new variants of this 
approach have been described including reverse dot-blot 

(46) , the Amplification Refractory Mutation System (47) 
or allele-specif ic polymerase chain reaction (48) , and 
Competitive Oligonucleotide Priming (49) . The Ligation 
Amplification Reaction, amplification of specific DNA 
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sequences using 6ec[uential rounds of template-dependent 
ligation, can also be considered as a peculiar applica- 
) tion of the allele-specific oligonucleotide approach 

(50). 

5 More discriminating is Denaturing Gradient Gel Elec- 

trophoresis, ejqploring the pattern of melting behavior 
characterizing each heteroduplex when elect rophoresed 
through an increasing gradient of DNA denaturants (51) . 
The sensitivity of this method can also be improved by 
10 pre-amplifying the target sequence by PCR, 

b. Ribonuclease and chemical mismatch detection 
The presence of a mismatch in a heteroduplex makes 
those molecules susceptible to cleavage by various means 
including chemical treatment with either hydroxyleuaine 
15 or osmium tetroxide (52), as well as ribonucleases such 

as RNase A in case of an RNArDNA heteroduplex (51) . 
Electrophoretic analysis of the cleavage products allows 
one to distinguish different genetic variants. Again, 
implementing PGR will increase the sensitivity of the 
20 approach. 

3- Single Stranded Conformation Polymorphism 
Under nondenaturing conditions, single-stranded DNA 
has a folded conformation that is stabilized by intra- 
strand interactions. Consequently, the confonaation, 

25 and therefore the electrophoretic mobility, is dependent 

on the sequence. DNA variants exhibit indeed mobility 
shifts when electrophoresed in such conditions, presum- 
ably resulting from conformational changes caused by 
sequence alterations, hence the name single stranded 

30 conformation polymorphism. Again, the altered mobility 

can be detected by blot hybridization analysis or rely- 
ing on PGR (53, 54). 
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4. Direct Determination of the DNA Sequence 
Obviously the most powerful approach towards DSP is 
the direct detemination of the DNA sequence. The need 
of a cloning step, however, in classical sequencing pro- 
5 tocols precluded the analysis of large samples. This 

limitation has been circumvented by the development of 
genomic sequencing (55) , allowing the direct determina- 
tion of defined DNA sequences from genomic DNA, and more 
recently and less laboriously by the development of 
10 direct sequence determination of PGR amplified products. 

The feasibility of the latter approach for the detection 
of DSP has been amply demonstrated in several independ- 
ent studies (see, for instance, 56) . 

C. Origin and Evolution of DNA Sequence Polymorphism 

DSP encountered in a given population find their 
origin in mutational events occurring in the germline 
and escaping the DNA repair machinery. The fate of 
these germline mutations in the population is dominated 
by two kinds of effects: stochastic and deterministic 
effects. 

1. stochastic Effects 

When a new mutation appears in the population, its 
initial survival depends largely on chance, regardless 
of its selective effect. This is easily illustrated as 
follows. Assvuae an individual heterozygous for a neo- 
mutation inherited from its parent, in whose germline 
the mutation appeared. If this individual has one, two 
or three offspring, the chances for the neomutation to 
be lost from the population, because transmitted to none 
of the offspring, are 0.5, 0.25 and 0.125 respectively. 
Even if inherited by part of the offspring, the same 
"stochastic filter" will operate in the next generation. 
In the course of this random drift, the overwhelming 
majority of mutant alleles are lost by chance. However, 
some will see their frequency increase in the 
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population, and despite fluctuations over time, 
eventually become fixed in the population, until 
s\ibstituted by the next mutant allele. 

AS demonstrated essentially by Kimura (57) in the 
5 framework of his neutral theory of molecular evolution, 

the probability for a selectively neutral neomutation to 
be fixed in a population of N individuals is equal to 
its initial frequency 1/2N, the average time for 
fixation is four times the "effective" population size 

10 or 4Ne, and the rate k of mutant substitution per 

generation is simply equal to the rate of mutation per 
gamete and per generation, M, independent of what the 
population size may be. 

According to this view, a polymorphism observed in 

15 a population at a given time is composed of "transient" 

alleles catched in their stochastic "odyssey" throughout 

the population. 

Populations for which 4.Ne.jii < 1 are essentially 
monomorphic, while populations for which 4.Ne.M » l are 
20 characterized by a substantial degree of transient 

polymorphism. The model predicts a steady state level 
of heterozygosity, H: 

4.Ne./i 

H a — 

25 4.Ne.M + 1 

2. petermini sti e Effects 

There is evidence that the fate of a significant 
proportion of DSP, especially those occurring in non- 
coding parts of the genome (composing the large majority 

30 of the genome), is essentially dominated by random 

drift. However, when a neo-mutation affects a DNA 
sequence which is expressed at the phenotypic level in 
the broad sense, the mutation may not longer be selec- 
tively neutral, and deterministic effects will be 

35 superimposed on the stochastic ones. Negative and 

positive selection will respectively decrease or 
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increase the probability and rate of fixation, while 
"balancing selection" will maintain specific alleles in 
a population in an equilibriiim state. 

a. Negative Selection 
5 When comparing DNA sequences between taxa, it 

appears that the estimated number of mutant substi- 
tutions per nucleotide to account for the observed 
divergence is highest for non-coding sequences, such as 
pseudogenes and intronic sequences, and much lower for 

10 coding sequences. For the latter, however, a difference 

must be made between first, second and third positions 
of the codons. The third position, for which only 28% 
of substitutions are expected to cause an amino acid 
change (versus 95* and 100% for first and second 

15 positions respectively) , exhibits the highest substitu- 

tion rate. When estimating that part of substitutions 
at the third positions which are so-called synonymous, 
rates very similar to non-coding regions are observed 
(57). Moreover, DSP are more prominent for non-coding 

20 sequences and, within coding sequences, at third codon 

positions (compared to first and second positions (58)) . 
These observations are easily explained by assuming that 
the fate of neomutations arising in non-coding regions 
or of synonymous neomutations, is dominated by stochas- 

25 tic effects, while the fate of mutations causing amino 

acid replacements will depend as well on whether or not 
they disrupt the function of the protein, in which case 
they will be eliminated from the population by negative, 
"purifying" selection. The higher the functional con- 

30 straints imposed on a protein, the higher the proportion 

of neomutations expected to be harmful and, hence, the 
lower the stO^stitution rate, expressed at the protein 
level as a higher "unit evolutionary time" (average time 
required for one amino acid change to appear in a 

35 sequence of 100 amino acid residues) . 
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These observations have been considered as a strong 
argument in favor of a predominant role for random drift 
in the dynamics of molecular evolution. 

b. positive selection 
5 According to the previous discussion, the major 

drive behind molecular evolution is non-adaptive in 
nature, which is in conflict with the classical theory 
of adaptive,, positive Darwinian selection. 

There is, however, evidence for positive and 
10 adaptive evolution at the molecular level in at least a 

few instances* comparing DNA sequences from members of 
two gene families: the serine protease inhibitors in rat 
(59) and the pregnsmcy specific Bl glycoprotein gene 
family in man, evidence has been foimd for higher 
15 substitution rates at first emd second codon positions 

than at the third position, in at least some protein 
regions, pointing towards positive selection. 

Moreover, there are a number of experimental data 
suggesting that some allelic differences identified by 
20 electrophoresis are associated with adaptation to 

different environments. In Drosophila, for instance, 
their is evidence for correlation between in vitro heat 
resistance of ADH variants and the temperature charac- 
terizing their geographical origin (58) . 

25 c. Balancing selection 

The evolutionary forces described so far generate 
transient DSP in the sense that the population fre- 
quencies of existing genetic variants will irrevoccibly 
change with time until either fixation or loss. In some 

30 cases, however, alleles may be maintained in a popula- 

tion at a steady state level, overdominance is one of 
the mechanisms susceptible to generate such a ••balanced 
polymorphism" . For a two allele system, this means that 
the heterozygous individuals benefit from a selective 

35 advantage compared to both homozygous genotypes. This 
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is expected to generate a steady state where both 
alleles are maintained in the population at respective 
equilibrium frequencies p and q, where 



s 

q „ QYi^ p - 



s + t s + t 

s and t being the respective selection coefficients of 
the homozygotes. 

The best known example of balanced polymorphism due 

10 to overdominance is the maintenance of the S a-globin 

allele (causing sickle-cell anemia in the homozygotes) 
as well as thalassemia-causing mutants (see, for 
instance, 61) in populations subjected to malaria, 
because of the resistance exhibited by the heterozygotes 

15 towards the parasite • The high level of polymorphism 

observed at the Major Histocompatibility Locus is 
thought to result from overdominance selection as well 
(62)* 

Frequency dependent selection may be another cause 
20 of balanced polymorphism, an example being the "rare 

mate advantage" observed in Drosophila (63). 

II. CONSTRUCTION OF PRIMARY DNA MARKER MAPS 

A. Linkage Strategies 

Two loci are said to be genetically linked if, 

25 during meiosis, they recombine at significantly lower 

than a 50% rate, i.e., they generate significantly more 
parental gametes than recombinant gametes. The recom- 
bination rate between loci reflects the frequency of 
occurrence of an uneven niimber of crossing-overs between 

30 the loci. Because the probability for crossing-over is 

proportional to the distance separating the loci, the 
recombination rate can be used as a unit of chromosomal 
length. This length unit is known as the Morgan (M) , 
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1 CM corresponding to the distance separating two loci 
ejdiibiting a 1% recombination rate. For small distance 
(^30cM) , the relation between centimorgan and recombin- 
ation rate is essentially linear. For longer distances, 
5 however, the relation is more complex, depending on the 

frequency of double crossing-overs, itself affected by 
eventual interference. 

Parental and recombinant gametes will only be 
distinguishable for doubly heterozygous individuals, 
10 hence the need for highly polymorphic markers. 

Recently, and due to the advent of the PGR, it has 
been possible to directly determine the genotype of 
individual gametes (64) . However, most of the time, the 
gametic contribution is infeixed from the genotype of 
15 the offspring and linkage studies are performed within 

families. Most modem linkage studies use the lodscore 
test for evaluation of linkage: a sequential test based 
on the method of maximum likelihood (65). The lodscore 
corresponds to loglO(LR), where LR corresponds to the 
ratio: likelihood of observations under alternative 
hypothesis e<0.5, divided by the likelihood of observ- 
ations under null hypothesis of no linkage, 6=0.5. In 
human genetics, a lodscore > 3 is accepted as signif- 
icant evidence for linkage. The prior probability of 
25 linkage between two loci has been used to justify this 

stringent critical value. Note that 21n(LR) can be used 
as well, having a chi-square distribution with one 
degree-of-freedom under the null hypothesis of no 
linkage. 

30 Recently, algorithms for multilocus linkage analy- 

sis have been developed, allowing an estimate of the 
most likely gene orders and genetic distances between 
several loci simultaneously (66-68) . 

Although usually determined within families, 

35 genetic linkage can manifest itself at the population 

level also: a phenomenon called "linkage disequilib- 
rium". According to the Hardy-Weinberg law, the 



20 
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equilibrium genotypic frequencies are reached in a 
single generation (except if the initial gene frequen- 
cies are not equal among sexes) • For a diallelic system 
with alleles al and a2, with respective allelic frequen- 
5 cies pi and p2, the equilibrium genotypic frequencies 

are pl^, 2plp2 and p2^ for alal, ala2 and a2a2 respec- 
tively. This does not necessarily hold when considering 
two loci simultaneously. The genotypic equilibrium 
frequencies are only reached when the previous genera- 
10 tion produces the four possible gametes at the expected 

frequencies: albl: plql, a2bl: p2ql, albl: p2ql, a2b2: 
p2q2- The difference between observed and expected 
gametic frequencies is called linkage disequilibrium, D. 
' The value of D is reduced by d.e every generation, e 
being the recombination rate between the two loci- For 
unlinked loci D diminishes by 1/2 every generation; for 
linked loci, however, the reduction of D per generation 
will be much smaller • The detection of a linkage 
disequilibrium is an indication of linkage between the 
corresponding loci. 

B- Genetic Maps 

Using this linkage approach, combined with alterna- 
tive mapping strategies such as "in situ" hybridization 
(see, for instance, 69), the use of somatic cell hybrid 
panels and radiation hybrid mapping (reviewed in 70) and 
comparative mapping (71) , the map location of a large 
set of DSP can be determined in order to build a genetic 
marker map (see, for instance, 72-74) • Assuming a total 
map length of 3 DM as for the human, and a desirable 
maximum distance of 20cM between markers, a set of 150 
DSP could cover the entire genome. However, many more 
markers will be needed to generate reasonable maps for 
our domestic species, essentially for two reasons. 
First, most of the time we have no a priori information 
on the location of the characterized markers. Hence, 
some chromosomal regions will initially be 
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overrepresented in our map, others underrepresented. 
This problem is expected to become critical in the later 
stages of the development of a map. Comparative data 
will then become critical, allowing to search for 
5 markers whose location can be predicted from other 

species, second, an individual will only be informative 
for the markers for which he is heterozygous; parts of 
his genome will thus not be explorable, because he will 
be homozygous for the corresponding markers. To compen- 

10 sate for this, one will have to identify more markers, 

the number required being inversely proportional to 
their heterozygosity — hence, the importance of highly 
informative systems. 

Once such a map is available, however, any gene for 

15 which the appropriate segregating family material is 

available can be located on the map. Assuming a maximum 
marker-target gene distance of lOcM, the expected 
lodscore for doubly informative, phase-known meioses 
approximates 0.16 (75). Therefore, 20 such meioses are 

20 theoretically sufficient to establish linkage with a 

lodscore of 3. In practice, however, the number of 
individuals to analyze will be higher, a function among 
other factors of the quality of the marker, expressed as 
its Polymorphism Information Content (76) . 

25 The efficiency of this approach has been illus- 

trated by the recent mapping of a large number of genes 
involved in human single gene disorders (see, for 
instance, 77, 135). The identification of DNA markers 
for a defined gene can be the first step towards its 

30 molecular cloning. Successful "positional cloning", or 

the isolation of a gene based on its map location, has 
been achieved in the human for Chronic Granulomatous 
Disease, Duchennes Muscular Dystrophy, Retinoblastoma, 
Wilms Tumor, Cystic Fibrosis (134), Type-1 Heurofibroma- 

35 tosis and the Testis Determining Factor. 

In domestic animals, genetic maps could be used to 
localize the genes underlying production traits, 
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allowing for Marker Assisted Selection, and a first step 
towards their isolation, the understanding of their 
mechanism of action and their manipulation by mutagene- 
sis and gene transfer methods. Several laboratories 
5 around the world are now involved in the development of 

markers and the construction of genetic maps for our 
main domestic species, especially cattle, pigs and 
poultry. 

III. GENETIC MAPPING OF QUANTITATIVE TRAIT LOCI 

10 The majority of traits dealt with in animal pro- 

duction are so-called quantitative traits, characterized 
by continuous variation. The phenotype of an animal with 
respect to a particular trait is the result of the 
effect of a several "polygenes" known as Quantitative 

15 Trait Loci, or QTL, combined with environmental effects. 

The number of polygenes involved is essentially unknown. 
Classically, it is considered very large, each gene 
contributing a very small part of the genetic variation. 
However, there is evidence both from the plant world and 

20 the animal world, that QTL with significant effects are 

common (78, 79). The most likely model is to assxime that 
there is indeed a large number of genes involved, but 
that there is a broad distribution of effects, substan- 
tial in some cases. Polygenes with extreme effects, 

25 whose segregation in a population may cause skewness and 

bi- or trimodality, are known as "major genes". Examples 
in animal breeding are "double muscling" genes in both 
cattle and pigs, the "White Shorthorn" gene involved in 
the determinism of "White Heifer Disease" and the 

30 "Booroola" fertility gene in sheep (80) . Even with 

significant effects on the trait of interest, however, 
their contribution to the total genetic variation may be 
limited in case of low population frequency. 

When dealing with quantitative traits, direct 

35 deteraination of genotype for the corresponding QTL is 
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iHpossible. Nevertheless, strategies have been designed 
to map QTL by linkage analysis. Within segregating 
populations, which is usually the case for our domestic 
species, QTL mapping can be performed both within 
5 families and at the population level. 

A. OTT. Mapping With in Families 

Traditionally one proceeds as follows: offspring 
from an individual heterozygous for both marker and QTL 
are grouped according to which allele at the marker 

10 locus they inherited; a statistically significant 

difference between the phenotypic means of the two 
groups indicates linkage between marker and QTL. Test 
for statistical significance is done by linear regres- 
sion (i.e. one-way analysis of variance) under the 

15 assumption of normally-distributed residual environ- 

mental variance. Classically, markers are tested one at 
a time for possible linkage with a QTL affecting the 
trait of interest. One of the drawbacks of this approach 
is that it is impossible to unequivocally estimate both 

20 map location of the QTL with respect of the marker, and 

its effect on the considered trait; no distinction can 
be made between a closely linked QTL with small effect 
and a loosely linked QTL with major effect. 

Recently, the lodscore method has been improved, 

25 making it possible to deal with quantitative and other 

complex traits and fully exploiting the power of the 
nearly complete marker maps which have become available 
for different organisms. This approach is known as 
interval mapping. Not only does interval mapping solve 

30 the problem of simultaneous estimation of location and 

effect, but because of its increased power, it reduces 
the number of individuals to be tested to detect linkage 
with a QTL of given effect (81) . 

Assuming that the marker is the QTL, the number of 

35 individuals to test in order to detect an effect of 

given amplitude, S, can be estimated from: 
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n > 



where n gives the required sample size, s^ is an estimate 
5 of the residual variance, t^ is the t value associated 

with Type I error, and t^ is the t value associated with 
Type II error; tj equals tabulated t for probability 2(1- 
P) where P is the required probability of detecting S if 
such a difference exists (82). 

10 For dairy production for instance, and if perform- 

ing the linkage emalysis using the "daughter yield 
deviations" (DYD; <y\^^ 600 lb) from paternal half-sibs 
("granddaughter design" (83), one would have to study 
1,500, 378 and 168 individuals, respectively, to detect 

15 QTL with differences of 200 lb, 400 lb and 600 lb 

between alternate alleles. Assuming phenotypic variance 
of (2500 Ib)^, such effects correspond to 0,08, 0.16 and 
0.24 standard deviations, respectively. These estimates 
assume a Type I error of 5%, a Type II error of 10%, and 

20 absence of recombination between marker and QTL. 

If the tested marker and the QTL recombine at a 
rate 9, the number of individuals to test increases by 
a factor 1/(1-29)^ for single marker analysis, by a 
factor »(l-r)/(l-29)^ in the case of interval mapping, r 

25 corresponding to the recombination rate between the 

flanking markers (81) . 

In view of the costs and time involved in geno- 
typing, it is importcint to minimize the required sample 
size. This can be achieved in various ways: 

30 a. Identification of the Individuals Most Likely to 

be Heterozygous, hence Informative, for the Studied OTLs 

One way to achieve this is to cross highly diver- 
gent strains for the trait of interest. In plant 
breeding, where the use of exotic germplasm is common 
35 practice, this is perfectly applicable. The 
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identif ication of markers for interesting QTLs from the 
exotic strains can then be used for their marker 
assisted introgression in the commercial varieties 4 

In animal breeding, however, introgression programs 
5 are very uncommon* With "Velogenetics" (described 

further below) , however, the use of exotic germplasm in 
introgression programs may become more attractive for 
animal breeders as well. 

An alternative approach is to identify the indi- 
10 viduals whose offspring are showing a higher variemce 

for the trait of interest. 

b. Selective Genotvoina of the Extreme Progeny 

As pointed out by Lander and Botstein (75) , the 
individuals whose genotype can be most clearly inferred 

15 from their phenotype are the ones providing most of the 

linkage information when studying complex traits. For 
quantitative traits, these are the individuals whose 
phenotypic value deviates most from the mean: the tails 
of the distribution. Seunple sizes could be reduced by 

20 60% and even 80% by focusing on individuals deviating 

one and two standard deviations, respectively, from the 
mean. Paradoxically, selective genotyping may be limited 
by the size of the studied population. Indeed, a larger 
sample will be required in order to find enough individ- 

25 uals one or two standard deviations from the mean. 

c. Decreasing Environmental Variance via Progeny 
Testing 

Weller et al. (83), have tested the effect of 
progeny testing to reduce the environmental variance by 

30 comparing the power of "daughter" and "granddaughter" 

designs for the detection of QTLs in dairy cattle. In 
the "daughter" design, marker genotype and quantitative 
trait values are assessed on daughters of heterozygous 
sires, while in the "granddaughter" design, marker 

35 genotypes are determined on sons of heterozygous sires. 
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their breeding values being determined by progeny 
testing from the quantitative trait value measured on 
their daughters. They demonstrate that for equal power 
the "granddaughter" design requires half as many marker 
5 assays as the "daughter" design. 



d. Reducing genetic noise by searching for several 
unlinked QTL simultaneously, or "simultaneous" search 
(81). 



e. Using DNA Pools 

10 Instead of genotyping all individuals separately, 

one can analyze DNA pools from individuals sorted by 
phenotype. Significant differences of allelic fre- 
' G(uencies between pools point towards possible genetic 

linkage between the corresponding marker locus and a 

15 gene or genes affecting the trait of interest. This 

approach can be used both for "within family" studies 
and for studies at the population level. The latter 
approach, however, requires linkage disequilibrium 
between QTL and marker locus. This method was first 

20 described by Arnheim et al. (84) to study the role of 

HLA class II loci in insulin-dependent diabetes 
mellitus. It was recently adopted by Plotsky et al. 
(85) to study association between DNA fingerprint bands 
and abdominal fat deposition in broilers. 

25 f. Exploiting "Tagged QTLs" 

The direct effect of selection for a production 
trait will be to increase the frequency of the favorable 
alleles at the segregating QTLs. However, this selection 
pressure may indirectly affect loci in linkage disequi- 

30 librium by so-called "hitch-hiking". 

This is probably what happened to the genetic 
defect causing progressive degenerative myeloencephalo- 
pathy, or Weaver in Brown Swiss, shown to be linked to 
a major gene for milk production. Because of the 
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deleterious effect of the Weaver causing gene, it is the 
heterozygous "carrier" genotype which is selectively 
most advantageous, generating a "balanced polymorphism", 
the Weaver causing allele being maintained in the 
5 population at a relatively high frequency. This can be 

exploited, however, to map the corresponding QTL by 
going through the relatively easy exercise (compeired to 
QTL mapping) of finding a marker linked to this single 
gene disorder- We have recently identified a marker 
10 linked to Weaver and presumably to the associated QTL. 

Besides Weaver, QTLs for a variety of polygenic 
traits have been Identified, both in plants and animals. 
Using complete DSP maps in tomato, Paterson et al. (78) 
identified at least six genes controlling fruit mass, 
15 four controlling soliible solids, and five controlling 

fruit pH, accounting for 58%, 44% and 48%, respectively, 
of the phenotypic variance. Martin et al, (79), using 
a similar approach, identified at least three tomato 
genes controlling water use efficiency. In cattle, 
20 Geldermann et al. (86) found significemt effects on milk 

yield (+ 200 kgs) and fat content (+ 1%) , especially for 
the B-lactoglobulin locus. More recently. Cowan et al. 
(87) demonstrated significant effects on milk production 
traits using a prolactin DNA Sequence Polymorphism as 
25 marker. 

B. OTL Mapping Wit^iin Populations 

One cem expect to find an effect of marker alleles 
linked to QTLs also outside of a family context, i.e., 
at the population level, if the two loci are in linkage 
30 disequilibrium. As reported by Hanset (88), and assuming 

a diallelic marker (alleles Ml and M2 with respective 
frequencies pi and p2) linked to a diallelic QTL, the 
phenotypic difference between the respective homozygotes 
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at the marker loci is: 

5 = 2a . — 



pl.p2 



5 with D measiiring the linkage disequilibrium and 2a cor- 

responding to the phenotypic difference between the two 
homozygotes for the QTL. 

Markers for which a priori evidence for linkage 
disequilibrium is highest are the so-called "candidate 

10 genes": genes expected from their physiological role to 

be likely candidates for the QTL itself. DSPs at those 
loci, even selectively neutral by themself, can be 
expected to exhibit linkage disequilibrium with the 
hypothetical functional mutations because of their very 

15 tight linkage. As an example, the B allele of the K- 

casein gene has been shown in several studies to 
increase protein yield in milk by about 3%, and possibly 
to improve cheese yield independent of the effect on 
protein yield (see, for instance, 89, 90) . 

20 IV. USE OF DNA MARKERS IN BREEDING PROGRAMS 

In classical selection programs, breeding values 
are estimated from individuals' own performances and 
performances of relatives (136) . The expected genetic 
progress is a function of the accuracy of selection, 

25 i.e. the correlation between estimated and true breeding 

values. All direct information on QTL can be used to 
increase the accuracy of selection and, hence, genetic 
response. Early on, Seller and Beckmann (91) proposed 
to exploit marker information for the preselection of 

30 young dairy sires before progeny-test. In cattle, 

Marker Assisted Selection is already used for the sexing 
of preimplantation embryos using Y-specific probes (see, 
for instance, 92), and for genotyping at the K-casein 
(see, for instance, 93) and prolactin loci (87) . In 
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pigs. Marker Assisted Selection is used to reduce the 
frequency of the major gene causing Porcine Stress 
Syndrome (PSS) . Susceptibility to PSS correlates with 
Halothane sensitivity or Malignant Hyperthermia. This 
5 condition has been mapped to a lin]cage group on pig 

chromosome 6, encompassing the following markers: S(A- 
O) -GPI-Hal-H-AlBG-PGD (reviewed by 94). These markers 
are used for the Marker Assisted Selection against the 
PSS condition. Recently, the ryanodine receptor gene 
10 has been identified as a good candidate for the Malig- 

nant Hyperthermia or Hal gene (95) . 

As shown by Smith and Simpson (96), the gain to be 
made with Marker Assisted Selection increases with the 
proportion of QTL identified and is highest for low 
15 heritability traits. Unfortunately, the QTL determining 

the latter traits are also the hardest ones to identify. 
It should be noted that the increase in accuracy is 
subordinate to the accurate estimation of the QTL 
effects. This may require larger samples than the ones 
20 needed for the detection of linkage. Once a QTL mapped 

by within-family linkage studies, it may be more effec- 
tive to identify supplementary flanking markers and to 
accurately determine the effect of the generated haplo- 
types at the population level. Selection can then focus 
25 on the best haplotype instead of spending initial selec- 

tion efforts on intermediate ones. 

The use of genetic markers in selection programs 
may as well reveal dominance deviation (particularly 
overdominance) and interaction deviation at defined QTL, 
30 variance components poorly dealt with in classical 

breeding theory. Specific programs may be required to 
fully exploit these QTL- In the case of overdominance 
for instance, two lines each homozygous for the dif ferm- 
ent alleles at each QTL could be developed and crossed 
35 to produce multiple heterozygotes . 

There is widespread interest in resolving 
quantitative traits into their Mendel ian components by 
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mapping the vmderlying QTL. The implementation of marker 
assisted selection into breeding schemes, however, has 
not always been received with enthusiasm. Part of the 
skepticism expresses the doubt that the genetic gains 
5 obtainable by marker assisted selection will justify 

expensive and tedious large scale genotyping. Although 
the costs of genotyping will drop substantially in the 
near future, due to the rapid pace at which automation 
and robotics are being applied to DNA technology, this 
10 objection remains very pertinent. 

Another major limitation of marker assisted 
selection under its present form, is its limitation to 
the exploitation of genetic variation preexisting within 
the commercial breed of interest, and only that present 
15 in a "high merit" genetic background. Favorcible 

mutations appearing within a mediocre background, or 
present in "exotic" germplasm, would be difficult to 
exploit, even with markers. 

We have therefore proposed a scheme, designed as 
20 "Velogenetics", combining marker assisted introgression 

and germ-line manipulations to reduce the generation 
interval, which might drastically increase the power of 
marker assisted selection (141) • 

IV. INDIVIDUAL IDENTIFICATION AND PATERNITY DIAGNOSIS: 

Methods to estimate the breeding value of an animal 
use information from relatives. As a matter of fact, 
keeping track of familial relationships has always been 
one of the major concerns of animal breeders, and 
parentage control is now a widely used procedure for 
several domestic species. Parentage control relies on 
the use of polymorphic systems within the studied 
population. The alleles that characterize an individual 
originate from the mother or the father. If one of the 
parents is known (usually the mother) , the alleles 
necessarily transmitted by the other parent can be 
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deduced easily. Paternity testing consists of scoring 
the existence or lack of those obligate paternal alleles 
in the genotype of the putative parent. Lack of one or 
more of these alleles points towards incorrectly 
5 assigned paternity. If, on the contrary, all obligate 

paternal alleles are present in the tested parent, there 
is no evidence for incorrectly assigned paternity. 
Nevertheless, one always has to consider the possibility 
of fortuitous coincidence. The higher the variation of 

10 the genetic markers used, the higher the probability to 

detect incorrectly assigned paternity, thus the higher 
the "exclusion power". 

Until now, the systems most often used for 
paternity testing were blood group systems, biochemical 

15 polymorphisms, or the major histocompatibility system. 

The availability of DSP, however, opens new perspectives 
for paternity diagnosis. Hypervariable minisatellites 
in particular, characterized by their remarkably high 
degree of polymorphism, have proven especially useful in 

20 this respect. Multilocus DKA fingerprints, based on the 

simultaneous detection of related minisatellite loci, 
have been shown to be extremely powerful for paternity 
diagnosis, both in human (19) and animals (108, 109). 
Exclusion powers as high as 99.999996% have been 

25 obtained with as few as 2 probes in the human (19). 

With such high exclusion powers, absence of exclusion 
can be considered proof for true biological parentage. 
Another corollary is that very high exclusion powers can 
be obtained even when a single parent is available and 

30 tested for parenthood. Multilocus DNA fingerprints, 

however, tend to be replaced by the combined use of a 
limited number of locus-specific VNTR markers (20), 
giving equally powerful, but more reproducible, 
sensitive and easily interpretable patterns. With the 

35 advent of locus-specific VNTRs and PCR-amplif iable 

microsatellites in animal species (44), the same will 
probably hold in this field too. 
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Along the same lines, DNA markers can be used as 
well for individual identification. Using expansion- 
contraction type polymorphisms, individual specific "DNA 
bar codes" can literally be generated (19, 110). 

5 SUMMARY OF THE INVENTION 

Disclosed herein is a set of locus-specific genetic 
markers for domestic cattle and related bovids, that 
constitute a primary bovine DNA marker map. Among other 
applications, these markers and the map are useful for: 

10 - individual identification, 

- parentage testing, 

- the genetic mapping of economic trait loci, or 
genes involved in the determinism of economicalli 
important traits, whether single gene traits or 

15 complex multifactorial traits, 

- marker assisted selection, 

- velogenetics, or the synergistic use of marker 
assisted introgression and germ-line manipulations 
to reduce the generation interval. 

20 The usefulness of this set of markers for the 

genetic mapping of economic trait loci is illustrated by 
the identification of a genetic marker for bovine 
progressive degenerative myelo-encephalopathy or 
"Weaver" in the Brown Swiss breed. 

25 BRIEF DESCRIPTION OF THE FIGURES 

Figure 1: shows a typical VNTR pattern obtained 
with probe GMBT-005, using Haelll. 

Figure 2: Example of a microsatellite pattern 
(TGLA9) . 

30 Figure 3: Schematic representation of 

"Velogenetics" - 
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DETAILED DESCRIPTION OF THE INVENTION 

I. CONSTRUCTION OF A PRIMARY BOVINE DNA MARKER MAP: 

Our lsd:)oratory has focused in the last two years in the 
development of a primary DNA marker map for cattle. We 
5 have now developed more than 300 highly polymorphic DNA 

markers of either of three types: 

1- Variable Nimber of Tandem Repeat Markers fVNTR) 
Hypervariable minisatellites are known to show 
significEint cross-hybridization between species (31, 44, 

10 110) . We have exploited this to isolate bovine VNTRs 

using heterologous minisatellite probes. Screening 
purpose-built libraries with minisatellite probes, we 
have isolated 36 bovine VNTRs, characterized by a mean 
heterozygosity of 59.3% within the American Holstein 

15 breed. Matching probabilities and exclusion powers were 

estimated by Monte-Carlo simulation, showing that the 
top 5 to 10 probes could be used as a very efficient 
DNA-based system for individual identification and 
paternity diagnosis. The isolated VNTR systems should 

20 contribute significantly to the establishment of a 

bovine primary DNA marker map. Linkage analysis, use of 
somatic cell hybrids and in situ hybridization 
demonstrate that these bovine VNTRs are organized as 
clusters, scattered throughout the bovine genome, 

25 without evidence for preterminal confinement as in the 

human (35) • Moreover, Southern blot analysis and in 
^itu hybridization demonstrate conservation of sequence 
and map location respectively of minisatellites within 
Bovidae. A typical VNTR pattern obtained with one of 

30 our probes is shown in Figure 1. Detailed description 

of our VNTR systems is reported in "Example 1". 
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2. M^JLtisit^ h^plotypes 

We used 110 random cosmids to probe Southern blots 
of 9 unrelated cattle DNAs digested with 12 restriction 
enzymes ♦ Although only one third of the expected 
5 fragments could be detected, 85% of the cosmids revealed 

at least one polymorphism. The mean heterozygosity of 
the generated multisite haplotypes (98) was estimated at 
51.9% . A surprisingly high proportion of polymorphisms 
(»25%) was attributed to insertion-deletion events, 

10 compensating for the lower level of nucleotide 

diversity, ir, observed in cattle (tt « 0,0007) as 
compared to the human* The mutation rate at cytosines 
in the CpG dinucleotide was estimated approximately 10 
times higher compared to other nucleotides. The 

15 generated markers should cover approximately 40% of the 

bovine genome when used in linkage studies. A detailed 
description of our multisite haplotypes is reported in 
"Example 2". 

3. Microsatellites 

20 Recently, microsatellites were proven to be an 

abundant source of highly polymorphic markers in the 
human (32-34). As their name implies, microsatellites 
are minute VNTR markers (18-20) , characterized by tandem 
repetitions of very short repeats, one to four base 

25 pairs in length. Microsatellites exhibit levels of 

polymorphism comparable to VNTRs, but are much more 
abundant and apparently evenly spread throughout the 
genome. We have estimated the frequency of (CA)- 
dinucleotide repeats in the bovine genome at > 150,000. 

30 Because of their small size, their detection is greatly 

facilitated by PCR. Although this imposes the 
preliminary determination of flanking DNA sequences to 
design the appropriate primers, the subsequent PCR 
reaction used for their analysis offers several 

35 advantages over Southern blotting, being fast, requiring 

less DNA and being easier to automate. 
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As part of our effort to build a primary DNA marker 
map for cattle, we have isolated more than 250 bovine 
microsatellites , amplified most of them in vitro and 
shown that the majority of them are indeed polymorphic 
5 in cattle. Several of these have been tentatively 

assigned to specific bovine chromosomes using a somatic 
cell hybrid panel. Moreover, we have shown that 
approximately 50% of the bovine microsatellites can be 
successfully used in other Bovidae as well, which will 
10 greatly facilitate the construction of marker maps in 

these species. 

Magnetic solid phase DNA sequencing procedures 
(137) are used for the massive generation of sequence 
information and multiplex approaches for genotype 
15 collection/ based on the simultaneous detection of 

molecules labelled with different fluorescent dyes using 
a laser-excited confocal fluorescence gel scanner (139) . 

A typical microsatellite pattern is shown in Figure 
2. A detailed escription of our microsatellites is 
20 reported in "Example 3", 

The relative location of the markers was determined 
by linkage analysis in pedigrees generated by multiple 
ovulation and embryo transfer. To assign linkage groups 
to specific chromosomes, highly polymorphic "anchor 

25 markers" were mapped using somatic cell hybrids (Jim 

Womack, Texas A&M) , and by in situ hybridization (Rudy 
Fries, ETH - Zurich) • 

Linkage emalysis involving 150 of these markers, 
generated a primary DNA marker map with 24 linkage 

30 groups counting two or more markers (15 assigned to 

specific chromosomes or synteny groups) , and €8 
singleton markers. A detailed description of our pri- 
mary bovine DNA marker map is reported in "Example 4"» 
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II. MICROSATELLITE MAPPING OF A MAJOR GENE FOR MILK 
PRODUCTION, LINKED TO BOVINE PROGRESSIVE DEGENERATIVE 
MYELOENCEPHALOPATHY OR WEAVER. 

Identifying polygenes, requires the analysis of 
5 pedigrees of considerable size, despite the development 

of procedures such as interval mapping, simultaneous 
search, selective genotyping, etc. In this work we have 
explored an alternative approach to map a polygene, 
exploiting the association observed in cattle between 

10 the single gene disorder "Weaver", and increased milk 

production. Weaver or bovine progressive degenerative 
myeloencephalopathy is a recessive disorder 
characterized by the appearance between 5 and 8 months 
of age of bilateral hind leg weakness, ataxia with 

15 deficient proprioceptive reflexes, without skeletal or 

muscular defects* Estimates of gene frequency in the 
American Brown Swiss breed point towards the maintenance 
of the Weaver gene at relatively high frequency (>5%) , 
despite the implementation of programs for detection and 

20 elimination of carrier bulls. Moreover, Hoeschele and 

Meinert (140) showed that Weaver carrier emimale have an 
advantage of 690.8 kgs milk (> 0.25 phenotypic a) above 
the mean. Both observations could be accounted for by 
the presence of a gene with major effect on milk yield 

25 in linkage disequilibrium with the "weaver" gene. 

Brown Swiss animals showing symptoms of Weaver were 
identified with the help of the American Brown Swiss 
Association. Blood samples were collected from the 
affected animals, their parents, and full-siblings when 

30 available. Diagnosis of Weaver was confirmed in most 

cases by anatomopathological examination of spinal cord 
and cerebelltm at the Department of Pathology of the 
College of Veterinary Medicine, Kansas State University. 
Shrunken Purkinje cells in the cerebellvim, spheroids and 

35 degenerated myelin sheets in the spinal cord were 

considered pathognomonic. Altogether, 78 animals were 
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identified generating a single, large pedigree. All 
animals were genotyped for jnore than 70 genetic markers: 
40 Variable Nmnber of Tandem Repeat markers and more 
than 30 Microsatellites. Linkage analysis was performed 
5 using the "LINKAGE" programs (60) • The microsatellite 

marker TGLA116 was giving a highly significant lodscore 
of 6.5 for a recombination rate of 7.5%. Although a 
priori probability for pair-wise linkage is unknown in 
cattle, a lodscore of 3 is generally considered to be 
10 the threshold for statistical significance as in the 

h\aman. This value (5.8) was obtained assuming complete 
penetrance. Actual penetrance for the Weaver condition 
is unknown. However, and because our pedigree was 
constructed by sampling clearly affected animals, the 
15 assumption of complete penetrance is very reasonable in 

this situation. 

The marker TGLA116 is characterized by three 
alleles segregating in our Weaver pedigree. 72% of the 
affected individuals were of the 3/3 genotype, 16% of 
20 the 2/3 genotype^ and 12% of the 1/3 genotype. Hence, 

and at least in our family material, the "Weaver" allele 
was clearly associated with allele 3 at the marker 
locus. Whether similar disequilibrium will be observed 
at the population level remains to be determined. The 
25 reported lodscore values were obtained using allelic 

frequencies estimated on a sampl of 135 sires from the 
American Brown Swiss breed. 

Because of the biased sampling procedure used to 
generate the pedigree markers showing distorted segrega- 
30 tion could generate erroneous evidence for linkage with 

the disease. A "control** pedigree, consisting of more 
than 100 Weaver-free Holstein individuals, was therefore 
typed for TGLA116 as well. The microsatellite marker 
was characterized in this pedigree by the same three 
35 alleles, with respective frequencies of 18%, 57% and 25% 

for alleles 1, 2 and 3, showing a perfect Mendelian 
segregation. Therefore, it is concluded that marker 
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TGIA116 is genetically linked to Weaver. From the 
generated lodscore curves, the genetic distance between 
the two loci is estimated at 3 ± 10 centiMorgan. The 
limits of this 95% confidence interval correspond to 
5 recombination rates with lodscores one unit below the 

obtained maximum lodscore. Because of the tight linkage 
between TGLA116 and Weaver, this marker should be linked 
to the associated QTL as well. The distance between 
TGLAl 16/ Weaver and QTL is, however, unknown at this 
10 point. The effect using Weaver as marker, however, was 

of such magnitude that the genetic distance separating 
these loci is unlikely to be great. We are in the 
process of determining the relative location of these 
three loci. 

15 In consequence, the TGLA116 marker will allow us to 

perform marker assisted selection against the Weaver 
condition. Indeed, it is now possible for offspring 
from individuals heterozygous for both the Weaver 
condition and TGIA116, to estimate the genotypic 

20 likelihoods at the Weaver locus based on their TGLA116 

genotype and that of their parents. 

In addition, we are now in a position to test the effect 
of the corresponding chromosomal segment on milk 
production. 

25 III. VELOGENETICS 



Few question the fundamental interest of resolving 
quantitative traits into their Mendel ian components by 
mapping the underlying QTL. The implementation of Marker 
Assisted Selection into breeding schemes, however, has 

30 not always been received with a lot of enthusiasm. Part 

of this skepticism reflects the disbelieve that DNA 
Marker Maps will become available for our domestic 
species within a reasonable time-span, or that QTL can 
be identified by linkage strategies. In our view, these 

35 arguments only reflect the lack of information of their 
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protractors. On the other hand, part of the skepticism 
expresses the doubt that the genetic gains obtainable by 
Marker Assisted Selection will justify expensive and 
tedious large scale genotyping. Although the costs of 
5 genotyping will drop substantially in the near future ^ 

due to the amazing pace at which automation and robotics 
are applied to DNA technology, this objection remains 
quite pertinent. 

Another major limitation of Marker Assisted Selec- 

10 tion under its present form, is its limitation to the 

exploitation of genetic variation preexisting within the 
commercial breed of interest, and only if present in a 
"high merit" genetic background. Favorable mutations 
appearing within a mediocre background, or present in 

15 "exotic" germplasm, would be difficult to exploit, even 

with markers. 

We propose a scheme, combining Marker Assisted 
Introgression and germ-line manipulations, to reduce the 
generation interval — which might drastically increase 

20 the power of Marker Assisted Selection; "Vel ©genetics". 

A. Marker Assisted Introgression 

The basic principle underlying Marker Assisted 
Introgression eire well-known. A gene responsible for a 
favorable attribute can be introgressed from a "donor" 

25 strain into a "recipient" strain by repeated backcross- 

ing. During the introgression process, the retention of 
the favorable gene is monitored in the backcross pro- 
ducts, with linked, flanking DNA markers. This latter 
aspect is particularly important for traits involving 

30 multiple genes and/or characterized by sex- or age- 

limited expression. Classical genetic theory tells us 
that, with the exception of the "marked" segment whose 
retention is desired, the genomic contribution of the 
donor line is diluted by half after each backcross. 

35 Hence, and after four backcrosses, the recipient genome 

is reconstituted to ± 90% of the original. At the 
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marked locus, however, the backcross retains one copy of 
the desired "donor" variant. If required, one intercross 
will then generate 25% of offspring homozygous for the 
favorable donor variant. The net result is a "graft" of 
5 an advantageous gene within a recipient background. The 

procedures entirely respects organization and chromo- 
somal localization of the grafted gene, avoiding 
aberrant expression patterns, which are too often 
characterizing transgenes. 
10 Noteworthy, the gene to be transferred does not 

need to be cloned per se. Only its genetic map location 
is required, as defined by the availeibility of linked 
markers, ideally flanking the gene of interest on each 
side* Hence, this procedure is perfectly applicable for 
15 the introgression of QTL identified through the pre- 

viously described mapping strategies. 

Marker Assisted Introgression can easily be applied 
to several genes simultaneously* This feature will be 
of particular interest for complex traits involving 
several genes. Introgressing more than one gene from a 
donor to a recipient line, however, increases the 
selection intensity at each backcross: with 1 marker, 
1/2 of the offspring have the favorable genotype, with 
2, 1/4 and with n markers, (1/2)". 

Selecting for the retention of defined "donor" 
genes will hamper the recovery of the recipient back- 
ground genotype in adjacent chromosomal regions. This 
can be compensated for by increasing the number of 
backcrosses, or better by monitoring the fate of addi- 
tional adjacent markers to identify the backcross 
products resulting from recombinations as close to the 
"grafted" gene as possible. 

B. Shortening the Generation Interval 

9f pomestic Species bv "Veloaenesis" 

Introgression by repeated backcrossing, assisted or 

not by genetic markers, is common practice in a variety 
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of organisms, but is essentially unfeasible In domestic 
animals such as cattle, because of their prohibitively 
long generation time. The generation interval of such 
species could, however, be reduced based on the "in 
5 vitro" maturation and fertilization of foetal oocytes, 

hereinafter referred to as "Velogenesis". 

An overview of female gametogenesis (100,101), 
indicates that the feasibility of such a scheme may not 
be that far-fetched. Briefly, oogenesis begins with the 
10 formation of primordial germ cells in the region of the 

allantois. These precursor cells migrate to the 
developing gonads where after a period of mitotic 
proliferation, they enter meiosis. Meiosis is arrested 
at the diplotene stage of prophase I by the poorly 
15 understood "meiotic division I arrest system", after 

which the primary oocyte enters a resting phase. During 
the life time of the animal, small numbers of resting 
primary oocytes are successively recruited into a pool 
of growing oocytes, within the environment of a 
20 gonadotr op in-dependent developing follicle. These 

activated oocytes growth in size, acquire the competence 
to resume meiosis if appropriately stimulated, and 
accumulate the required material to sustain the early 
stages of the subsequent embryonal development. 
25 Resumption of meiosis and oocyte maturation is triggered 

by a hypothetical maturation- inducing signal produced by 
granulosa cells in response to gonadotropins. At least 
in rodents, oocyte maturation seems to be mediated by a 
drop in cyclic AMP in the oocyte and subsequent 
30 inactivation of a type A protein kinase. Evidence for 

the role of this pathway in oocyte maturation is, 
however, much more controversial in ruminants. Note 
that in the granulosa cells, gonadotropins act, among 
other pathways, through the activation of adenylate 
35 cyclase with subsequent increase in cAMP concentrations 

(102) - In the oocyte, a cascade of still to be deter- 
mined events then probably leads to the phosphorylation 
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and activation of a phosphatase, probably homologous to 
the S. Pombe cdc25 gene (103), which will itself dephos- 
phorylate and activate the M-phase promoting factor 
(MPF) , now known to be a complex of a p34'''^''^rotein 
5 kinase subunit with a B type cyclin (see 104 for a 

review) * The maturing oocyte completes the first 
meiotic division and enters the second (becoming a 
secondary oocyte) which will be arrested as well at 
metaphase II until fertilization. This "meiotic 

10 division II arrest system" is thought to reflect the 

stabilization of MPF mediated by the kinase activity of 
pp39°"" on either a cyclin protease or on cyclin itself • 
Fertilization relieves this block, by increasing the 
intracellular Ca^"^ concentration, triggering calcium- 

15 dependent protease activity (reviewed in 104) . 

In cattle, primordial germ cells reach the genital 
ridge at about 40 days of gestation. After a period of 
mitotic proliferation, they differentiate into oogonia 
starting around 60 days of gestation. Mitotic prolifera- 

20 tion of the germ line ceases around day 170 of gestation 

fixing the maximum number of oocytes the female will 
ever have. Meiosis starts at about 80 days, and the 
first primordial follicles are discemable at 90 days of 
gestation. Remarkably, activation of resting primordial 

25 follicles starts already in utero . around day 140, and 

secondary and tertiary follicles can be seen at 210 and 
2 30 days, respectively. It is estimated that 2 to 4 
resting primordial follicles are recruited daily into 
the pool of activated, developing follicles. These 

30 activated foetal oocytes, however, are irrevocably 

committed to follicular atresia* Indeed, spontaneous 
oocyte maturation and ovulation do not begin until 
puberty. Submitted to appropriate hormonal stimulus, 
however, prepiibertal oocytes can resvime meiosis, can be 

35 fertilized and can produce viable offspring. Indeed, 

offspring have been obtained from gonadotropin- 
stimulated calf oocytes, transferred to postpubertal 
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recipient animals (reviewed in 105). The purpose of 
velogenesis would be to atteuqpt to obtain similar 
results with foetal oocytes at the earliest stage 
possible, as early as 90 to 180 days of gestation. 
5 very encouraging is the development in mice of 

culture systems supporting the growth of primary 
follicles, yielding mature oocytes capable of fertil- 
ization in vitro and development to term (106, 107) . It 
is reasonable to anticipate that similar conditions, 

10 supporting development of bovine oocytes, will become 

available in a species were primary oocytes from 
relatively small antral follicles can already be 
successfully matured and fertilized in vityo. 

On way to achieve velogenesis would be to attempt 

15 to rescue oocyte nuclei from primordial follicles by 

their transfer into enucleated, maturable oocytes. 

so far we have only discussed velogenesis through 
the reduction of the female generation interval- "Male" 
velogenesis could similarly be accomplished by the early 

20 stimulation of spermatogenesis. 

The impact on breeding programs of "velogenesis" or 
the reduction of generation time by in vitro maturation 
and fertilization of fetal oocytes has been discussed by 
Betteridge et al. (101). In dairy breeding, for 

25 instance, annual responses in milk yield could be 

doubled compared to conventional progeny testing. With 
the added power of Marker Assisted Introgression, the 
approach becomes much more powerful. "Velogenetics", or 
the synergistic use of Marker Assisted Introgression and 

30 "velogenesis", can be viewed as a procedure for the 

rapid and efficient intraspecies transfer of desirable 
genes between genetic backgrounds. By analogy with the 
term "transgene" , the manipulated genes are referred to 
as "velogenes". 

35 in particular, desirable traits identified outside 

commercial breeding stock, could be quickly introgressed 
into high merit genetic backgrounds. Examples would 
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include disease resistance, genes affecting milk and 
meat composition. Polled, coat color genes, etc. More- 
over, the possibility to exploit "exotic" genetic 
variation identified outside the breed of interest is 
5 particularly attractive because it greatly facilitates 

the mapping of the genes of interest. 

A schematic representation of "Velogenetics" is 
shown in Figure 3. 

The present invention is further detailed in the 
10 following Examples, which are offered by way of illus- 

tration and are not intended to limit the invention in 
any manner. Standard techniques well known in the art 
or the techniques specifically described below were 
utilized. 

15 exa^tpm; I 

CHARACTERIZATION OF 
A SET OF VARIABLE NUHBER OF TANDEH REPEAT MARKERS 
CONSERVED IN BOVIDAE. 

INTRODUCTION 

20 Human minisatellite sequences, exhibiting very high 

levels of genetic polymorphism due to variation in the 
number of tandem repetitions, have proven an invaluable 
source of genetic markers commonly termed "VNTRs" (18- 
20) . VNTRs have been instrumental in the genetic mapping 

25 of several disease-causing genes, as tools for 

individual identification and paternity diagnosis and to 
address a variety of biological issues, including 
imprinting, loss of heterozygosity in malignancies, etc. 
In animal genetics, highly polymorphic markers such 

30 as VNTRs could similarly be used for individual 

identification and paternity diagnosis - relying today 
on less informative biochemical polymorphisms and blood 
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group systems and for the mapping of so-called 
economic trait loci (ETL) or genes involved in the 
determination of production traits. Classically, 
artificial selection has relied on the biometrical 
5 evaluation of breeding values from individual 

performance records and from performzmces of relatives 
(136) . One of the powers of the biometrical approach is 
that it obviates the need for any detailed molecular 
knowledge of the underlying genes or ETL» However, it 

10 is believed that the genetic mapping of ETL could be 

used to increase genetic response by affecting accuracy 
aind speed of selection, through a procedure called 
marker assisted selection (MAS) (91, 96) • Moreover, 
defined alleles could be moved efficiently between 

15 genetic backgrounds by velogenetics or the combined use 

of marker assisted introgression and germline 
manipulations aimed to reduce the generation interval 
(141, 142). 

We report the cloning and characterization of 36 
20 bovine variable number of tandem repeat (VNTR) markers, 

characterized by a high degree of polymorphism within 
commercial herds and shown to be conserved within 
Bovidae. 

MATERIALS AND METHODS 

25 1* Cloning of bovine VNTRs and detection of 

polymorphism: 

500jtig genomic DNA from 20 unrelated cows was 
digested to completion with Mbol or Haelll. After two 
rounds of size fractionation by agarose gel 

30 electrophoresis, electroelution and addition of EcoRI 

linkers (only for Haelll restricted DNA) , fractions from 
3 to 4 Kb (kilobases) , from 4 to 6 Kb and above 6 Kb 
were ligated into the BAP-dephosphorylated BamHI and 
EcoRI sites, respectively of pUC13- Approximately 

35 80,000 independent clones were obtained by 
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transformation of DH5a cells, and were screened 
successively with the following minisatellite sequences: 
the 282 base pair Haelll-Clal fragment containing the 
minisatellite in the protein III gene of wild-type M13, 
pUCJ, PSP64.2.5EI ('Per'), pa3'HVR64, pINS310, EFD134.7 
and pS3 (20, 21, 110, 123, 143). Hybridization and 
washings were done in the conditions used to generate 
XttUltilocus DNA fingerprints with the respective probes 
(110, 143). To check for polymorphism, plasmid DNA 
isolated from positive colonies was used to probe Mbol, 
Haelll and TaqI Southern blots of IS randomly selected 
American Holsteins. Hybridizations were done at 65 C in 
7% SDS, 10% PEG, 50mM NaHP04 with addition of SO/ig/ml 
bovine genomic DNA. Final washes were at 65 C in 
O.lxSSC, 0.1% SDS. When using bovine probes on ovine 
southern blots, hybridization and washing temperatures 
were reduced by 10 C. 

2. Estimation of Matching Probabilities and 
Exclusion Powers: 

20 Allelic frequencies were estimated from the sample 

of 18 randomly selected American Holsteins. Matching 
probabilities and exclusion powers (113) were then 
estimated by Monte-Carlo simulation (10,000 simulations 
in each case) , assuming Hardy-Weinberg equilibrium and 

25 using "Pat-Power", a program designed by one of us. The 

following parameters were estimated: MPR: matching 
probability for two randomly selected individuals; MPS: 
matching probability for full-sibs; EPR: probability to 
exclude an alleged father unrelated to the real one 

30 (mothers phenotype known); EPS: probability to exclude 

an alleged father full-sib to the real one (mothers 
phenotype known); EPl: probability to exclude a wrongly 
assigned parent without phenotypic information available 
from the other one. 



10 
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Patpover calculates Matching Probeibillties and 
Exclusion Powers characterizing given autosomal poly- 
morphic systems, by Monte-Carlo simulation. 

Matching Probabilities relate to individual identi- 
5 fication and express the likelihood that two individuals 

would have the same pattern with a given probe, Patpower 
calculates two types of Matching Probabilities: MPR, the 
Matching Probability for two unrelated individuals, and 
MPS, the Matching Probability for two full-sibs. 
10 Exclusion Powers relate to paternity diagnosis and 

express the likelihood that a wrongly assigned paternity 
or maternity will be detected with a given probe. 
Patpower determines three types of Exclusion Power: EPR, 
where one parent is known with certainty, the proband is 
15 unrelated to the other real parent; EPS, where one 

parent is known with certainty, the probEUid is full-sib 
of the other real parent; and EPl, whre only the proband 
is available. 

The user needs to input the number of alleles cheor- 
20 acterizing the polymorphic system in the population of 

interest, their respective frequencies, and their 
dominance-recessivity relationships. For the ABO blood 
group system in humans, for instance, A and B are 
codominemt and both dominate 0. Each allele is given a 
25 binary code following the rules of the "LINKAGE" program 

(60). 

Patpower then stochastically generates a pair of 
parents with an offspring, a full-sib of the real father 
and an unrelated individual. "Phenotypes" are obtained 

30 from the genotype using the boolean "or" operator and 

are used to determine matching between unrelated indivi- 
duals (MPR) and between full-sibs (MPS) , as well as the 
exclusion of the unrelated individual considered as a 
proband, with (EPR) and without information (EPl) from 

35 one of the real parents, and exclusion of the uncle 

considering information from the real mother (EPS) . 
This simulation is repeated as many times as determined 
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by the user, allowing for the estimation of the respec- 
tive livelihoods. 

3. Segregation and linkage analysis: 

All members of an American Holstein pedigree with 
5 91 offspring obtained by multiple ovulation and embryo 

treinsfer (MOET) from 20 parents, were genotyped for all 
identified markers. Each parent has a mean of 9.1 
offspring with a mean of 1.9 partners. Segregation and 
linkage cinalysis were done with slightly modified 
10 versions of the "LINKAGE" programs as previously 

described (31). 

4. Synteny mapping: 

The hybrid somatic cells were prepared by fusion as 
previously described (97) . Southern blot hybridization 
15 and concordancy analysis were done according to 

Threadgill et al. (114). 

5. in situ bybridlzatlon: 

Chromosomes were prepared as described by Fries et 
al. (115) and chromosome identification was based on 
20 QFQ-banding and according to the international standard 

(116) . Probe preparation and in situ hybridization were 
as previously described (144) . 

RESULTS 

1. A set of bovine VHTR markers: 

25 Using the strategy described above, we have 

isolated a total of 36 bovine VNTRs, listed in Table 1. 
Polymorphic patterns were attributed to minisatellite 
sequences when characterized by more than two alleles 
distinguishable with more than one restriction enzyme. 

30 Seven additional, non VNTR-type polymorphisms were 

detected during this experiment and are reported as 
well. 
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VNTR Clones 



5 



10 



15 



20 



25 



30 



35 



Name 


Locus^ 


pplyiaorDh^ 


Enz 


iiex. • 


V/V XJL-i 


GMBT-002 


DY1651b 


VNTR 


Haelll 


52 


XT 


GMBT-003 




VNTR 


Haelll 


56 




GMBT-005 


D24Sib 


VNTR 


Haelll 


85 




GMBT-006 


D14Sib 


VNTR 


Haelll 


73 


VT 

£4 


GMBT-007 


DUlOSib 


VNTR 


g^elll 


96 


F 


GNBT-008 




VNTR 


TagI 




f 


GMBT-009 


DU22Sib 


VNTR 


H$ieIII 


58 




GMBT-011 


D26Sib 


VNTR 


ffaelll 


85 


T5 

XT 


GMBT-012 




VNTR 


Mbol 


22 




GMBT-013 




VNTR 


fjaelll 


4 




GMBT-015 


D21S3b 


VNTR 


Haelll 


61 


F 


GMBT-016 


D21S12b 


VNTR 


Haelll 


78 


F 


GMBT-017 


D8Sib 


VNTR 


Haelll 


15 


mr 


GMBT-019 


DlOSib 


VNTR 


Mbol 


7 




GMBT-020 




VNTR 


Mbol 


oD 




GMBT-021 


D21S2b 


VNTR 


Haelix 


OD 




CTIBT-022 


DlSSib 


VNTR 


Mbol 


4U 




GMBT-025 




VNTR 


Haexxx 






GMBT-026 




VNTR 


XT -a f^TTT 

Haeixi 


ZD 




GMBT-027 




VNTR 


JIPOX 






GMBT-028 


D2Sib 


VNTR 








GMBT-031 




VNTR 


Haelll 


58 




GMBT-033 




VNTR 




70 




GMBT-034 




VNTR 


Haelll 


20 




GMBT-035 




VNTR 


Haelll 


59 




GMBT-036 


DU27Sib 


VNTR 




89 




GMBT-039 




VNTR 


Haelll 


33 




GMBT-041 


D23Sib 


VNTR 


Haelll 


81 




GMBT-042 




VNTR 


Haelll 


78 




GMBT-047 


D2S2b 


VNTR 


Ha^III 


65 




GMBT-049 




VNTR+PM 


Haelll, 
Mbo 
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TABLE 1 (Continued) 
VNTR Clones 





Name 


Locus ^ 


PolvmorDh^ 


Enz . ^ 






vnox vox 






WaoTTT 


OA 


5 


GMBT-053 




VNTR 


Haelll 


59 




GMBT-058 




VNTR 


TaqI 


89 




GMBT-059 


DU10S2b 


VNTR 


QaipHI 


67 




GMBT-060 




VNTR+PM 


Mspl 


87 




GMBT-002 




PM 


TaqI 


37 


10 


GMBT-024 




PM 


TaqI 


62 




GMBT-029 




PM 


Ifflafil 


28 




GMBT-014 


DU22S2b 


(?) 


Mbol, Taal 


68 




GMBT-018 




(?) 


TaqI 


17 



15 1 LOCUS: locus name following HGM nomenclature rules 

whenever available from mapping studies. 

2 POLYMORPH: type of polymorphism (VNTR: Variable Num- 
ber of Tandem Repeats; PM: Point Mutation; (?): un- 
explained) • 

20 3 ENZ: preferred restriction enzyme for its detection. 

4 HET: heterozygosity within Holszteins, estimated 
from a sampel of 27 presximably unrelated Holstein 
animals. 

5 OVIN:cross-*reaction in sheep; N, negatie; M, mono- 
25 morphic; P, polymorphic, not tested. 
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Within the American Holsteln breeds the mean 
heterozygosity over all VNTR systems was 59.3*. When 
using probe GMBT-016 with Mbol instead of Haelll, and 
supposedly because of the presence of minisatellite 
5 variant repeats (MVR) (39) harboring Mbol sites, an 

extremely viuriable, locus-specific "midisatellite" 
pattern (36) is generated (data not shown). Used with 
Mbol, this probe is particularly powerful for individual 
identification. 

10 We found one clear instance of maternal neomutation with 

probe GMBT-022. Besides this, all probes showed proper 
Mendeliein segregation. 

Table 2 reports estimated matching probabilities 
and exclusion powers as well. Systems GMBT-009, GMBT*- 

15 Oil and GMBT-022 were treated as "open" systems, meaning 

that - because of their small size - some alleles were 
not detectable in our conditions. To avoid ambiguities 
in identification and paternity diagnosis, these 
unidentified alleles were pooled in a single "recessive" 

20 class- For individuals showing a single band, no 

distinction was made between homozygosity and 
heterozygosity based on band intensity. 

Discrepancies between probe ranking according to 
heterozygosity versus ranking according to matching 

25 probabilities and exclusion power, most probably results 

from the small sample size used to estimate both types 
of parameters. However, heterozygote advantage at some 
loci could be an alternative although unlikely 
explanation in view of the apparent neutral behaviour of 

30 human minisatellite sequences (124) . 
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Matching Probabilities for VNTR Clonea 
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TABLE 2 (Continued) 
Matching p r^babllltifts for V»TR Clones 
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85.79 


92.78 


03.59 


01.86 


00.27 



MPR is Matching Probability for two randomly selec- 
ted individuals. 

MPS is Matching Probability for two full-sibs. 

EPR is Exclusion Power when putative father is 
unrelated to real father. 

BPS is Exclusion Power when putative father and 
real father are full-Sibs. 

EPl (or BPSP) is Exclusion Power when only one 
parent is available. 
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2. Genomic distribution: 

We performed pair-wise linkage analysis between all 
markers. As it is known that at least some of these 
sequences are organized as minisatellite clusters (31, 
5 35, 145) , we were expecting to find tightly linked 

systems. We found evidence for five pairs of linked 
markers of which four were characterized by a 
recombination rate inferior to 5% (Table 3). However, 
two of the systems involved in a tight linkage detect 

10 non VNTR-type polymorphisms (GMBT-014, GMBT-022) . The 

corresponding probes were probably isolated because they 
contain a genuine although non-polymorphic 
minisatellite, and were fortuitously detecting other 
types of polymorphism. Despite these five linked pairs, 

15 results of the linkage analysis pointed towards a 

scattering of these markers throughout the bovine 
genome . 



TABLE 3 



Linked Systems 




lodscore^ 


GMBT-003 and GMBT-029 


0.0% 


5.00 


GMBT-007 and GMBT-059 


11.3% 


9.11 


GMBT-009 and GMBT-014 


4.8% 


3.74 


GMBT-015 and GMBT-016 


3.7% 


27.00 


GMBT-028 and GMBT-047 


2.5% 


9.40 



25 

1 e = recombination rate. 

2 pair-wise lodscores were calculated 
with the "LINKAGE" programs. 

Reference markers for the respective synteny groups 
30 were U1:GNB1, U2:KE1, U3:NKNB, U4:MPI, U5:FOS, U6:AMY1, 

U7:LDHA, U8:GNB2, U9:GPI, U10:SOD1, U11:VIM, U12:GPX1, 
U13:MET, U14:GSR, U15:CASK, U16:ABL, U17:CRYG, 
D18:GGTB2, U19:CAT, U20:GL01, U21:GH, U22:AMH, 
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U23:ALDH2, U24:TG, U25:CLTIA, U26:OAT, a27:DU27Slb, 
U28:MBP, U29:RBP3 and X:DMD. Synteny groups with highest 
concordancy scores, to which corresponding VNTRs were 
assigned, are underlined. 
5 Evidence for a broad genomic distribution of our 

VNTRs was supported by the tentative assignment of 13 of 
them to 11 different synteny groups using somatic cell 
hybrids (Table 4). GMBT036 identifies a previously 
unmarked bovine synteny group. Probe GMBT-021 was 
10 assigned to the same synteny group as probes GMBT-015 

and -016. Although the latter two probes were shown to 
be tightly linked, linkage between those probes and the 
former one could be excluded for recombination rates < 
15%. 
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Eight VNTRs were as well mapped by "in situ" 
hybridization: GMBT-006 to 14qll-16, GMBT-005 to 
24ql3.3-22, GMBT-011 to 26qll-21, GMBT-015 and GMBT-016 
to 21q22-24, GMBT-019 to 10gl4-23, GMBT-022 to 19q21-23, 
GMBT-028 to 2ql3-21. Again good genomic coverage was 
evident, since six probes mapped to five different 
chromosomes. Probes GMBT-015 and -016 both mapped to 
21q 23-24 as expected from the linkage study and the 
assignment on the hybrid panel. Surprisingly, five out 
of the eight VNTRs clearly showed an interstitial map 
location. Only probes GMBT-015, -016 and -022 were 
located proterminally , the former two identifying the 
same minisatellite cluster. These results seem to 
contrast with those of Royle et al. (35) , which 
demonstrated preferential preterminal mapping of human 
VNTRs, Probe GMBTOll, previously located on U26, was 
mapped to chromosome 26, allowing us to tentatively 
assign synteny group U26 to chromosome 26. 

3. Conservation of sequence and map location within 
20 Bovidae: 

We hybridized ten bovine VNTRs to sheep Southern 
blots, under slightly reduced stringency conditions. 
Seven of them were yielding locus specific patterns, of 
which six were showing a substantial degree of poly- 

25 morphism in a sample of 5 unrelated sheep (Table 1) . 

Probes GMBT-016, -019 and -022, mapping in the 
bovine to 21q23-qter, 10ql5-q24 and 19q21-qter 
respectively, were mapped by in situ in sheep as well. 
The three probes produced signals on chromosomes 18, 7 

30 and 11, recognized as evolutionary homologues of bovine 

chromosomes 21, 10 and 19 (116) . Moreover, the signals 
were found over the exact positions as expected in case 
of conservation of chromosomal location in cattle and 
sheep. 
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DISCUSSION 



To isolate bovine VNTRs, we have used a strategy 
similar to Wong et al. (146, 147), based on the 
screening of size-selected restriction fragments 
5 obtained by complete digestion with the four-cutters 

Mbol and Haelll, Advemtages of this strategy are: (1) 
the complexity of this size-range is substantially 
reduced; following Bishop et al. (128) and assuming an 
exponential distribution of restriction fragment 
10 lengths, the fragments > 2 Kb represent about 10*^ of the 

total number of Mbol or Haelll fragments, corresponding 
to approximately 10* fragments; this allows us to work 
readily with plasmid vectors; (2) the subsequent search 
for and use of the polymorphism is performed with the 
15 seime enzyme used to generate the libraries, obviating 

the need to screen several restriction enzymes, hence 
reducing costs; (3) relying on frequent four-cutters, 
the cloned minisatellites contain very little flanking 
sequences and only very few of them carry highly 
20 repeated sequences which would interfere during 

hybridization; (4) theoretically, the larger 
minisatellites targeted by this approach are more likely 
to be involved in mutational events and could therefore 
be the more polymorphic ones* 
25 A disadvantage of this approach is the unequal 

representation of minisatellite loci in our library. 
The libraries were generated with a mixture of DNA from 
20 unrelated individuals, to increase the number of 
clonable microsatellites* As a consequence, loci for 
30 which most alleles are within the selected size range 

will be over represented, compared to loci for which the 
majority of alleles in the population are bellow this 
range. 

This collection of bovine VNTRs could be used for 
35 DNA based individual identification and paternity 

diagnosis • Combining our top 5 probes, matching 
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probabilities and exclusion powers at least as good as 
those obtained with classical systems are obtained: MPR: 
8x10"® r MPS: 4x10"^ EPR: 0.999, EPS: O.B93r EPl: 0.987. 
Adding more probes will of course only increase the 
5 power of the system. As a matter of fact, these probes 

have been used efficiently to solve paternity problems 
beyond the power of blood group systems. DNA typing is 
not limited to blood samples as present systems are, 
which expands its spectrum of applications and power. 

10 As an example, DNA typing has been used to deal 

efficiently with fetal blood cell chimerism (127), 
frequently encountered in cattle. Compared with 
multilocus DNA fingerprints, locus-specific VNTRs are 
much easier to interpret and are more reproducible. 

15 Following properly established standardization 

procedures, a "common language" could be established 
allowing exchange of information between laboratories. 
It is noteworthy that heterozygosity and allelic 
frequencies for some probes seem to vary substantially 

20 between breeds. As an example, probe GMBT-012 is 

characterized by an heterozygosity of 22% in Holsteins, 
but higher than 50% in both Herefords and Brown Swiss. 
Hence, proper use of these probes may initially require 
accurate estimation of genetic variation for different 

2 5 breeds . 

Assuming a coverage of 20 cM per marker in linkage 
studies, the set of markers described in this paper 
would allow the scanning of approximately 7 Morgans. 
Accepting a total map length for the bovine genome of 25 

30 Morgans (148), this represents close to 33%. We have 

complemented the set of bovine VNTR described in this 
paper with over 80 multisite haplotypes, generated with 
cosmid probes, and more than 100 microsatellite systems 
(31, 148, 149). Therefore, the majority of the bovine 

35 genome is now amenable to linkage scanning. Since 

several of these markers are already "anchored" to 
specific chromosomes or synteny groups, a primary bovine 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 



PCT/US92/00340 



-56- 

DNA marker map should soon be available. Moreover, the 
remarkable conseirvation of mini- and microsatellites 
within Bovidae will substamtially accelerate the 
construction of genetic maps in sheep and goats and 
5 offer the possibility to address interesting 

evolutionary issues. 

EXAMPLE 2 

GENERATION OF BOVINE KUIiTISITE HAPLOTYPES 
USING RANDOM COSMID CLONES • 



10 INTRODUCTION 

The possibility to generate nearly unlimited 
numbers of genetic markers through the study of DNA 
Sequence Polymorphism (DSP) (76) , has revolutionized 
hiiman genetics: genetic markers have been used to map 

15 genes involved in a variety of human diseases, which has 

direct implications for genetic counselling strategies 
and is a first step towards their subsequent cloning by 
reverse genetics; they are revolutionizing individual 
identification and the examination of familial 

20 relationships; and they are invaluable tools in the 

study of a wide variety of biological issues. In 
particular, they are expected to play a key role in the 
ongoing efforts to entirely map and sequence the human 
genome. 

25 For breeders of domestic animal species, the 

availability of large numbers of genetic markers means, 
besides new approaches for individual identification and 
paternity diagnosis, the possibility to map and study 
genes determining production traits , and to use this 

30 information in marker assisted selection and 

velogenetics (91, 96, 141, 150). 

Particularly - challenging is the fact that the 
majority of production traits are complex. 
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multi factorial traits. Animal breeders, however, have 
the advantage that phenotypic information has been 
carefully recorded for thousands of animals over the 
years for use in classical biometrical breeding 
5 programs, and that they can, if necessary, design and 

generate the ideal family material required for such 
mapping studies. 

In both the human and animal field, polymorphic 
markers characterized by the highest possible hetero- 

10 zygosities or "Polymorphism Information Content" 

(PIC) (76) is paramount. Hence, the focus has changed 
from the original diallelic Restriction Fragment Length 
Polymorphisms (RFLPs) to more informative systems based 
on the study of sequences such as minisatellites (18, 

15 20) , and more recently microsatellites (32-34) and the 

polydeoxyadenylate tract of SINE-repetitive elements 
(37) . Minisatellite sequences in particular have proven 
very powerful. They seem to suffer, however, from a 
non-random genomic distribution, especially in the htiman 

20 where in addition, they show preterminal confinement 

(35). Microsatellites, although very abundant and 
highly polymorphic, require prior sequencing efforts to 
generate the primers needed for their in vitro 
amplification. Moreover, the large scale use of 

25 microsatellites requires the development of more 

efficient multiplex amplification and data collection 
schemes . 

An alternative strategy for the generation of 
highly informative marker systems is to combine several, 

30 closely spaced diallelic RFLPs into more informative 

polyallelic multisite haplotypes (98) . We have explored 
the use of random bovine cosmid clones in Southern blot 
hybridizations in order to identify such sets of closely 
spaced DNA Sequence Polymorphisms. Because of the 

35 population structure imposed by breeding strategies, 

effective population sizes of domestic species are 
expected to be reduced compared to humans. It was 
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interesting therefore, to check in how far this would 
decrease the observed level of genetic variation and in 
how far the expected concomitant increase in linkage 
disequilibrium would affect the efficiency of the chosen 
5 approach in domestic animal populations. 

MATERIALS AND METHODS 

1> Prepzuration of cosnld clones; 

Bovine genomic DNA was prepared using standard 
procedures, partially digested with Mbol, and size 

10 fractionated by rate zonal centrifugation in a 10%-40% 

sucrose gradient. Size fractions around 40 Kb were 
ligated into the Xhol site of the cosmid vector pWEC 
(pWE 15 vector (Stratagene) with pUClS polylinker - 
Erica Cumlin, personal communication) , after partial 

15 fill-in of the insert and vector sticky ends with 

respectively dATP, dGTP and dCTP, dTTP. The obtained 
constructs were packaged into Gigapack II Gold extracts 
(Stratagene) and used to infect E.Coli 490A hosts (gift 
from R. White, University of Utah, Salt Lake City, Utah, 

20 USA) - 110 colonies were selected at random, cosmid DNA 

was prepared using standard procedures, and purified by 
CsCl/Ethidium Bromide isopycnic centrifugation. 

2, Southern blot hvbridigation: 

Genomic DNA from 9 unrelated Holstein individuals 
25 was prepared from venous blood using standard procedures 

and digested with 5U/^g of the following enzymes in the 
presence of 4mM spermidine: BamHI, Bgll, Bglll, EcoRI, 
EcoRV, Hindlll, Kpnl, Mspl, PstI, PvuII, TaqI and Xbal. 
4/ig DNA per individual was separated according to size 
30 by agarose gel electrophoresis and blotted onto Pall 

Biodyne B membranes using NaOH 0.4M as transfer buffer. 
Membranes were prehybridized at 65 C for 4 hours in 10% 
PEG, 7%SDS, 50mM NaHP04 (pH 7.2) in the presence of 
350Mg/ml bovine genomic DNA. Cosmid DNA was labelled by 
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random-priining (111) to specific activities of .5*10^ 
cpm//ig, prehybridized with bovine genomic DNA (5ing/inl) 
for 90 min* at 68 C (112), and added to the 
prehybridized membranes for 16 hours. Final washes were 
5 in O.IXSSC, 0.1%SDS and at 65 C* Autoradiography was 

carried out for 2 to 6 days at - 80 C with Kodak XAR-5 
film and intensifying screens (DuPont Cronex Lightning- 
Plus) . Membranes were stripped by boiling into 0.1%SDS 
and reused up to at least 10 successive times. 

3. Calculation of puoleotide diversities: 

Nucleotide diversities, tt, corresponding to the 
average heterozygosity per nucleotide site were 
estimated following Ewens et al. (130), using: 

TT = 

22rimilnni 

where n^ stands for the number of chromosomes studied 
with the i*''^ enzyme, ri for the number of bp of the 
recognition sequence of the i** enzyme, mi for the ntimber 
20 of cleavage sites explored with that enzyme, of which ki 

are polymorphic. 

The nvimber of explored restriction sites was 
estimated from the number of fragments f observed by 
Southern blotting, using m == (3f+l)/2 (119) . 



10 



15 



25 RESULTS 

110 randomly selected bovine cosmid clones were 
used in Southern blot hybridization experiments as 
described in Materials and Methods. 96 of them, or 
87.2%, gave usable patterns and were kept for further 
30 analysis. combining data from the 12 restriction 

enzymes used, a mean of 53.87 fragments per cosmid 
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qualified as unambiguously readable. The only RFLPs 
considered in this paper, are the ones affecting these 
selected fragments. 

Assuming a cosmid insert size of 40 Kb and 
5 estimating the mean restriction fragment length in Kb 

(L) for a restriction enzyme according to Bishop et al. 
(128) , the expected number of restriction fragments 
detected in Southern blot hybridization per cosmid probe 
for a given enzyme can be approximated by: 
10 integer (40/L)+l. For our 12 enzymes, we expect 

therefore a total of 173 fragments per cosmid probe* 
Therefore^ the 53.87 fragments actually observed per 
clone, represent only 31% or less than 1/3 of what is 
theoretically possible. The remaining 69% are missed 
15 either because they were considered difficult to read, 

or more often because they went undetected due to their 
abundance in highly repetitive elements blocked by the 
competitor DNA, or due to their size, too small for 
efficient detection in our conditions of Southern blot 
20 hybridization. The smallest fragments qualifying as 

readable in this study, were in the 1 Kb size-range. 
The latter factor is particularly apparent with the two 
used four-cutters, Mspl and TaqI, whose expected mean 
fragment length are the lowest (1747bp and 1179 bp 
25 respectively) despite the presence of the rare CpG 

dinucleotide. Only cibout 15% of the expected number of 
fragments are detected for these two enzymes. 

Nevertheless, as much as 82 of these 96 cosmids, or 
85%, were showing at least one polymorphism within our 
30 sample of 9 randomly selected individuals. The detected 

polymorphic events are classified into two groups: 1) 
Point Mutations ("PM"), whenever a defined polymorphic 
pattern is only seen with a single enzyme, and 2) 
Insertions-Deletions ("ID"), whenever such a pattern is 
35 seen with two or more enzymes. Following these rules, 

we identified 215 polymoirphic events, or a mean of 2.6 
independent RFLPs per cosmid probe. 162 of these 
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(75,3%) were considered of the PM type, the remaining 53 
(24.7%) of the ID type. Table 5 summarizes these 
results . 
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TABLE 5 



Bovine Multisite Haplotypes 



NAHE BaiHI 



figll 



Bglll 



EcoR! 



EcdRV 



W3BT002 



lldiil) 
?/3 



HSBT005 2PH{5) 

3P«C5J 
4.4/7,8 



I ID (17) 

4.2i2.9/3. 9+3.2 



1ID(171 

9.5+1.5/9. 1+L6 



KSBT0C7 IIDCl?) 
3/14 



KSBT009 



HSBTDOA 



1PHC23). 
?/5.B 



"5BT011 315(39) 
7,8/5 



1P«(22) 
7.7/B 
2PR(39) 
5.5/4 



110(221 
9.5/10 
2P«(A) 
?/3.7 



KSBT015 1ID[44) 
14/13 



^«(231 
25/22 



1ID{44) 
fi.5/6.5 



HS6T016 



HSBT017 
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TABLE 5 (Continued) 



NAKE Hindi! I Kpnl Hspl PstI PvuII 



HSBT019 


??(??) 




1PK(22) 
20/17+2. B 
2PKU7) 
30+5.5/36 


HSBT02C 






IPX(ll) 
4.3/5.5 


RSBTOOl 21D(55) 
9/12.5 




lID(ll) 
12/12.5 


??(??) 


NSBT0C5 




lIBd?) 


iimi) nmi) 

5.8+5/5.4+4.8 

4PK{50) 

4/7.B 


HSBT007 1ID(17) 
12/6. B 




lIDil?) 
8.2/9.5 




H5BT009 




iPK(44) 
3.9/5.7 


2PH(46) 
4.5+5.1/9.5 


HSBTOOA 




2PK(28] 

5/6.5 

3PH{11) 

3.8/2.e 

4P«£5I 

11/9.3+l.G 




«SBTOn 4PH(22I 
16/14 


MSBT013 


1ID(22) 


??{??) 





19716 
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TABLE 5 (Continued) 
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NAME Hindlll Kpnl 



Hspl 



PstI 



PvuII 



KSBT015 



??(??! 



ltSBT0l6 



H5BT017 



1ID(40) 
5.B+4.3/6.5 



K5BT0I9 



3PM(50) 
lI/&.4+4,3 



MSST020 2PM (22) 
15/10 
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TABLE 5 (Continued) 



NAHE TaqI Xbal 


HET. 


nSBTOOl ??(??) 


55 


HSBTOOl 


0 


MSBT005 lID(17i 


54 


4.2+2.2/3.9*2 




WSBT005 3PK(5) 


C 


14/6 




HSBT007 


17 


M3BT009 


77 


MSBTOOft 




SO 


HSBTOOfi 


0 


ftSD I VVr 


ft 


KSBTOll 3ID(39) 


hi 


5. 3+2 A 




HSETOll 


0 


t III /in n 

4.4+4.6+2,9 




HSBT013 lIDt22) 


33 






HSBT013 


0 


HSBT015 110(44) 


66 


12/9 




MS3T015 3PHE33) 


0 


6.3/10.8 




HS6T016 1ID(40) 


50 






M5BT0I7 


0 


HSBT019 4P«(44) 


89 


M5BT0i9 SPH(36) 


0 


KSBT020 3PK(35t 


66 


30/25 
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TABLE g (Continuad) 



NAHE BaiHI 


Bgll 


Bglll 


EcoRI 


EcoRV 


HSBT022 lPn(47) 
3.3/3.5 






2PH(47) 

5.B/2.5+3.2 

3PH(5] 

5.8/9.2 

4PM (8) 

?/2,I 




MSBT0Z3 


H5BT024 






IID(26J 
14/7.5 


lO/iO.7 


HSBT025 


mm) 


2P«(33) 
12/9.7 






WSBT026 






S/? 


!??(??) 


HSBT02e 


1ID[37) 
5/fc,3 

21W>) 
3/? 


1ID(3?) 
14/10 
21D(??) 
15/? 


lIDt37l 
13/6.6 


1ID{37) 
8.1/5.5 
2ID(??) 
3.4*5/? 


KSBT030 l?m7] 
8/5. s+: 


KSBT03I 








00(09) 


KSBT032 IPK{22) 
9.6^6/16 






2ID(U) 
?/6.2+7+12 


210(11) 
9/6.1 


HSBT033 






IPNdl) 

3.5/5 

210(44) 


2ID(44) 
6/7. B 

??(??) 



5.8/7 

3PH(44J 

9/12 



HSBT034 
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NARE Nindlll 



l(5BT022 



TABLE 5 (Continued) 



Kpnl 



Hspl 



PstI 



PvuII 



HSBT023 



lPn{45) 

5.5+3,7/3.2+2 



K5BT024 iID(26) 
?/7.8 



1ID(261 
7.5/5 



110(26) 
13m 



HSBT025 ??(??) 



HSBT026 ? 



2??(??1 



H5BT028 



KSBT030 



HSBT031 ??(??) 



??(??) 



3??(??) 
3.3/3.1 



??(??) 



HSBT032 2ID{11) 

vn 

3P«(33) 
4.6/4.9 
4ID(56} 
4/3.4 



nS6T033 2ID(44) 
14/15 

??(??) 



4ID(66) 
6.5/2-5 



4ID(6&) 
1.7/4.3 



2ID(11) 
?/2.7 



??(??) 



HSBT034 



1ID(44) 
5.9/3.9 



??{??) 



1ID(441 
4.7/4.3 
2PH{22) 
5.3/4 
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TABLE 5 (Continued) 



NAME TaqI Xfaal HEX. 



WSST022 5P«f44) 89 
?/3.2 

RSBT022 ??{??) 0 

K5BT022 0 



KSBT023 2P«(33) 45 

17/13 

H5BT024 miU) IID(2&) 26 

14/15 3.B/4.7 
KSBT024 ??f??l 0 



NSBT025 3PK(37) 76 
KSBT025 4PK(??) 0 



H55TC26 0 



HSBT028 37 
«5ETC2E 0 



KSBTOSO 2PK{11I 78 
5/5.2 

H5BTC3: ??{??! 0 



KSBTC32 4IDf56) 61 
4.1/4.3 

RSBT032 0 

HSBT032 0 



IISBT033 5PM(44) 2ID(44J 88 

4/3.4 5/6.9 
M5BT033 0 
??(??) 

JSBT033 0 



HSBT034 1ID(44) 56 
5.B/fl 

f(SBT034 3??(??) 0 
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"^^BLB 5 (Continued) 



MAKE BaiHI 



Bgll 



Bglll 

1IDE55) 
fc. 3/6,6 



EcdRI 



EcoRV 



MSBT035 



«SST037 



HSBT03B 5PKt??J 



lIDCll) 
6.6/10.5 



lIDCin 
15.0/14.0 



lID(ll) 
7.5/6,4 



ilD(ll) 
7.5/6.4 



nSBT039 IPII(ll) 
9/9-6 

nSBT040 



mill) 

lB/i& 



HSBT041 



HSBT042 lPltl33) 
14/15 

2PK{33) 
6/2 



HSBT043 



1ID144) 
4.5/4.7 
2ID[55) 
3.7/4.3 



110(441 
3,2/3.6 



nSBT044 



HSBT045 nmW 210(891 2ID(89) 

4/3.4 3.9/6.5/3.5 4/4.B/3.5 



nSBT046 IID(78) 110176) 

3.5/3.6 3.5/3.6 



lfSBT047 IPHdl) 
£.3/7.3 
2PN(30) 
?/l0.5 
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TABLE 5 (Continued) 



KAfE Hindlir 


Kpnl 


Kspl 


PstI 


PvuII 






2ID(??) 




2iDf??) 


mmi 






1P«£22) 
4.5/5 
4??f??) 
3.8/? 




mim iiDdiJ 

9.6/8.5 


110(11) 
6.0/5.5 


2PH(37) 
19/2.7+16 
4??(??) 
4.3/? 




IIDCU) 
6+B/2 


H5BT039 


nSBTWO 




2??(??} 








ftSBT042 


3^(44) 
7/? 


??(??) 






l(S3T043 1II)(44J 
6.5/6.9 
3Pfi(78) 
4.8/3.3 
4Pft(78) 
4.8/4.6 
9P«(??) 
3/2 




5PHt33) 
2.9/3.1 
6ID(33I 
3,7/3.9 




6IDC335 
?/2+2.I 
7PK(U) 
?/2.4 


K3BT044 0 








1ID(??) 

3.B5/3.9/3.95.. 


nS6T045 1I&(44) 
Ii/10.5 




1ID(44) 






KSBT046 










KSBTO*? 




3PK(llJ 







6.5+2.B/9.3 

mm 

19/14 
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TABLE 5 (Continued) 



NAME TaqI Xbal HET. 



HSBT035 1ID(55) 
7.8/5,3 




55 


KSBT037 




22 


«SBT037 




0 


nSBTOZe lIDdl) 
5.5/? 

H5BT038 3PHE22) 
3.5/? 


lIDdll 

£.s/s.e 

6??(??) 


44 

0 


«3BT039 




11 


RSBT04C 




22 


HSBTOAi 




0 


KSBT042 ??(??) 


??(??) 


44 


HSBT042 




0 


USBT043 mm) 

3.3/2.9 
M3BT043 2ID(55) 
1.4/2.2 




99 
0 
0 


HSBT043 




D 


HSBT044 




0 


tJSBTa45 3PK(33) A*? 
11/12 


mim 2PN{33) 
4.9/5.2 

tmm imm 

2.9/1.7 




7B 
0 


14/21 

KS6T047 




44 

0 
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TABLE 5 (Continued) 

MAKE BaiKI Bgll Bglll EcoRI EcoRV 

HSBTC48 



HSfiT049 






110(22) 
9/3.9+5.3 


IID(22) 
9.7/4 








mm) 

VB.2 




HBBTOSl 2P«(22) 
20/lE 




IPMU) 
7/14 






H5BT052 




lIDdl) 
17/U 




2I0(??) 


HSBT053 








1PH(??) 


MSBT054 


HSBTOS^ 




1ID(55] 


1ID(55I 
5.6/7 
2PH[22) 
12/5.5 


110(551 
5.7/7.3 


BSBT057 






lIDCUl 
?/B 


110(11) 
9/B.5 
2Pn(??) 
7/6-7 


BSBT059 






110(331 * 
1.8/3.4 


110(33) 
?/3.2 




imiu 

6/5. B 


110(11) 
5/6.5 


lIDCU) 
9.6/11 
2ID(55) 
6.4/9.6 


110(11) 

9.6/U.5 

210(55) 

3.5/6.6 

3PH(22) 

9.6/6.8 
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TABLE 5 (Continued) 



mi Hindlll Kpnl 


Hspl 


PstI 


PvuII 


HSBT048 I ID (44) 1ID{44) 


110(44) 




1ID(44) 


6.2/5.9+3 27/15+7.8 


4.3/3.7+3.2 

n 1 1 1 / 

5.5/4.6 




6.5/9 




Jrn \ ! fl 






B.4/7 








HSBT050 2P«(??) 2ID133) 




3PHtn) 


2ID(33) 


?/L5 




3.2/2.8 


2+1,6/6.8+1.8 


HSBT051 


3P«(??) 






HSBT052 


1ID(??1 




lID(ll) 
12/10.5 


HSBT053 




2ID{44) 
?/6.S 


2ID{44) 
2.1/2.4 
3PK(??) 
11,5/11 


HSBT054 




lP«t22) 




MSBT056 






IID(55) 
3/2.7 


HSBT057 


nSBT059 


1ID(33) 
12/11 






RSBmO lID(ll) 
6.2/6.8 




4PM{??) 
2.7/1.4 




HSBT061 110(44} 
7+2.4/9.3 






110(441 
4.7/5.2 
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TABLE 5 (Continued) 



HAHE TaqI Ibil HET. 



HSBT04B 
K5B704B 


1ID(44) 
6.4/7 


55 
0 


HSBTW9 




22 


MSST050 2IDt33) 

2.5+2.7/2.4+2.9 




55 


5.2/6 
MSBTOSi 5P«(33) 
3.3/3.1 




7B 
0 


HSBTC52 lIDCll) 
B/7.B 




li 


HSBT053 ?? 




44 


HSBT053 




0 


HSBT054 2PM{11) 
4.A/7.4 


3PHt33» 
3.7/4,8 


33 


KBBTC56 lIDt55) 
2.5/4.7 

RSBT056 




66 

0 



KSBT057 3PK(n5 4PHt44) 

14/9.5 20/15.5+<.5 

MSBT057 



01059 2PHt6fc) 
7.8/6.6 

HSBT059 mim 



KSfiT060 
1^1060 
H5fiT060 
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T ABLE 5 (Continued) 



mi BaiHI 


Bgll 


Bglll 


EcoRI 


EcoRV 


M5BT062 








??(??} 


HSBT064 


MBBT065 




1ID(44) 
4.5/7 






HSBTOi? 








IIB(55) 
9/7.3 


H56T068 lID(ll) 
lB/11 






lIDdl) 
14/12 


ilDClU 
10/8.8 


MBBT069 2PK(33I 

16+3.5/19.5 




1PK(44) 
12.5/10.5 


3PH(ll) 
?/9 


4PH(22) 
9/6.4 . . 


HSBT070 


iP«[lI) 
?/3 




2ID(55) 
2.5/2.9 


2ID(55) 
2.7/3.2 


HSBT071 


KSBT072 




1ID(33} 
13/12 






HSBT074 


nSBT075 










HSBT07A 210(441 
6,2/4.B 




1ID{33) 
2.5/2 


210(44) 
2/2.B 


210(44) 
2.3/2.9 


nsBTore 






IPH(ll) 
7.6/6.7 


2PHtll) 
7,8/6.7 


MSBT079 r>m) 


MSBTOSO 




110(11} 
4.5+2/5.7+4.3 






KSBTOBi 




mm 




??(??) 
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TABi,£ 5 (Continued) 



MA«E HiDdlll. 


Kpn! 


Hspl 


PstI 


PvuII 




HSET064 


lPfi{33) 
25/19 








MSBT0£5 




1II){44) 
5.2/4 






HSBT067 




IIB(55) 
3.9/3.5 






HSBTOiB 


HSBT069 








5P«f33J 
?/3.2 


WSBT070 3PJt(22) 

9.4/7.9+1,5 


4P«tlll 
13/10.7+2 






5P)1(44» 
8.7/9-5 


BSBT071 








iPR(??) 
?/8 


H5BT072 2PH{I1) 
9.5/6.2 




3.7/4 


3PH(??) 
?/5.4 


4PHf44) 
5,9/9 


KSBT07; 1FH(221 
4,6/5,3 




2PHt44) 

10.S/I2/9.5 

3PH[33) 

17/U.5 

4P«{441 

10.8/12/9.5 




5Pfi(33; 
3.3/5.7 



KSBT075 



BSBT076 



IISBT07B 3PJ!(11) 

?/6 



IISBT079 



1^080 UDdit iifiaii 

2/2.6+2.7 ll/B+2.8 



«SBT081 2f1I(33) 

2.2/2.7 
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TABLE 5 ( Continued ) 



NAME TaqI 


Xbal 


H£T. 


MSBT062 ??(??) 




0 






33 


HSBTOitS 1ID144) 


110(44) 
4.8/3.1 


44 


K3BT067 


2PMi??) 


55 


hSBTOfcB 2PK(33) 
3/1.6 




44 


K5BT069 




89 


M5BT070 &**H{^'>) 

llwiilVIV Up lit- .# 

?/3.7 




78 


KSBT071 2PK(U) 
15/11 


3PK£??) 


U 


HSBT072 


5Pft[in 
20/12 


78 


HSBT074 




66 


KSBT074 




0 


MSBT07; 




0 


K5BT075 IPHdll 
3.2/4,4 




U 


KSBT076 1 ID 133) 
2/3. B 




44 


K5BT078 


4PHt33) 
4/6.4 


44 


MSBT079 
MSBT079 


IPHCin 
11/13 
2Prt(33) 
3.5/3.7 


44 

0 


tISBTOSO 


2PK(n) 
6.3/3.9+2.5 


22 


I^BTOBI 




33 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 PCr/US92/00340 

-78- 



TABLE 5 (Continued) 



NAKE BaiKI 


Bgil 


Bglll 


EcoRI 


EcdRV 




«S»T083 2PKf44) 
11/17 


mill) 

?/4.5 




311) (33) 
5/3.8 


3ID(33) 
5.5/3.9 


flSBT084 1PH(44} 
4.5/9 


HSBT0B5 


iID(44) 
7.7/6.5 
2ID(22J 
3.3/4.B 






3ID(55) 
5.5/4. B 
2ID(22) 
6/4.5 


KSBT0B6 2ID122) 

10.5+2.2/12.5 




IPH(ll) 
6.2/6*9.4 




3PH(IU 
?/3.7 



RSBTOe? 



HSBTOBB 



nSBT0e9 



HSBT091 

5.5/6 



«5BT092 



KSfiT093 



liSBT094 



HSBT096 



«SBT097 



HSBT098 
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TABLE 


5 (Continued) 




NAHE Hindlll 


Kpnl 


Hspl PstI 


PvuII 




HSBro83 mm 


HSBT084 




2P!i[22) 
5.1/3.9 




HSBT085 2ID(22J 
12/5.7 


nSBT086 




2ID(221 4ID(55) 
18/21 2.5/5 
4IDC55) 
4.3/7 


5P«(55) 
7.1/5 


MSBTOB? 


MSBT08E 


KS6T0B9 


flSBTO90 


KSBT091 




??(??) 3IB133) 
?/4 




lfSBT092 1PHI4A) 
4.3/2.8 




2PH(22) 
3.5/4 




MSBT093 




IPR{33) 
4.7/1.9+2.5 
2PH(33) 
?/3.3 




ttS6T094 


KSBT096 


1P«(11) 
8/4-B+3.2 






HSBT097 1P«(??) 


KS3T09B 




lPn(66) 


2PK(??) 
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TABLE 5 (Continued) 



HAKE TaqI Xbal KE7. 





HSBT083 5PK11I) 
?/4 




66 


fiSBT084 3PHC331 
2.3/2.4 

H5BT084 4P«t??) 
?/2.B 




55 
0 


HSBT085 3ID(55) 
5/3. B 

rtSBTOBS IID(44) 
?/2.7 


2IIK22> 
U+15/23 

— 


55 
0 


HSBT0B6 4ID(55) 
6/3.3 

H5BT0B& 




7B 
0 


HSBT037 




0 


nsBToas 




0 


ieBT0B9 




0 


ttssmo imn) 




11 


«3BT091 3IDt33} 

A/C D 


— 


33 


RSBTa92 3PH(33) 
5*3/6.8 




66 






44 


IISBT093 




0 


KSBT094 




0 


RSBT096 




11 


ISBT097 


2Pn(??) 


0 


HSBT098 3P«(55) 
4.5/4 


18/15 


66 
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TABLE 5 (Continued) 



mi fiatHI 


Bgll 


Bgll I 


EcoRI 


EcoRV 


«SBT09B 


MSBT099 1PH{33) 
14/15 






210(111 
10.5+4/15+3.9 


2ID{11) 
8+4/8.8+3.9 


H5BT100 


WSETlOl 2P«{22) 
8/10.2 


1PH(44) 
9+7/14 








HSBT1C3 


H5ET104 


HSBT1C5 






lPR(??i 




H5BT106 lIDt22) 
?/9.5 






1ID(22] 
?/B.3 


110(221 
?/8.5 


HSBT107 2P«{67) 

12+2.4/14.4 


1ID(331 
2» 6+2,8/3.1 








MSBT108 2PK(11J 
9.5/7.8 




IPHCll) 
15/6.5 






«SBT109 


KS3TI10 








1PH(44) 
10.3/9.1 


RSBTUl 






ipncii) 

?/3 




HSBT113 






1PH(22) 
10/9.5 




n5BTll4 1IDC33) 
?/4.4 






1ID(33) 
B.B/8+1.9 


1ID(33] 
5.5/4.8 
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NftllE Hirdlll 



TABLE 5 (Continued) 
kpnl «s?I PstI 



PvuII 



KS3T098 



«SBT099 



2IDaiJ 

19/21 

3P«t78) 



2IDCU) 
2.1/1.9 



4.9/4.8 

4P«(??) 
?/4.2 



KS6T1O0 



ItSBTIOl 



RSBT103 



IPHCIII 
8/U.5 



3PK(55) 
3.55/3.6 



RSBT1C5 



2PK(??1 



ftSBTlOfc lIDt22} 
?/3,7 



K3BT107 



3??(??) 



HSBTIOE 



KSBTI09 nD(55J 

4.9+5.B/10.B 



WBTllO 



1ID(55} 
4.6/3-4 



HSBTUl 



2ID(Z2t 
2.9/3.2/2.1 



310(11} 
7.5/9.2 



RSBTU3 2W1(33) 


3ID(11) 




7.5/3.4+4.2 


?/10 




3Ifitil) 


4IDC55J 




4/7+3.8 


12+21/35 





KSBTli4 119(33) 
?/2.5 



1111(33} 
U/IO 



1ID(33} 
5.3/5.9+1.6 



1IDC33} 
?/4 
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TABLE 5 (Continued) 



NAKE TaqI Xbal HET. 



MSBT09B 4P«(n] 0 
2.5/2.B 



RSBT099 2ID11I) 78 
4.9/4.8 

HSBT099 5PH(22) 0 
2.5/1.5 



HSBTIOO 11 



HSBTlOl 4PHC33) 66 

18/5/7.3 

HSBTlOl 5PH(33) 0 

ie/5/7.3 



KBBT103 0 



HSBTi04 0 



hSBTlOS 0 



HSBTlOa 22 



M5BT107 1ID(331 78 
2.1/2.6 



HSBTIOB 3P«{50) 55 

7/U.5 



MSBT109 2P«{??J 55 



MSBTllO 2P«t??l 44 



HSBTlll 3IDIU) 22 
7/8.8 

HSBTlll 4P«[ll) 0 
3.7/4.9 



HSBT113 3ID(lll 66 
VI 

HSBT113 4ID(55) ^ 
6.S/6.9 



HSBTU4 I»t33) 55 

?/4.9 
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TABLE 5 (Continued) 



HAKE BaiKI Bgll Bglll EcoRI EcoRV 



HSBTI14 2PH(22) 



HSBT1I9 



KSBri20 




imii) 






VI 


H5BT121 


lPKt33) 


2Ptt(33) 






5/5.4 
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NAKE Hindlll 



Hspl 



PstI 



PvuII 



HSBTiU 3P«(22) 

e.7/io.e 



4PH(55) 
5/A.7/7.2 
5P«(55) 
5/4.7/7.2 



??{??) 



iP«(33) 
6.5/5-5 



HSBT119 1ID(55) 
11/5.2 



KSBT120 2PHIA4) 
7,5/6.1 



HSBT121 
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TABLE 5 (Continued) 



mi TaqI itb al ^ ^ H^^- 

H5BTU4 ^ 

HSBTIU ^ 



nSBTU9 il»t55) 
4.2/2.B 

HSBT119 2PH(44) 
6/7.7 



5SBT120 3PH(44) 
5.2/3.7 



HSBT121 3PH(44) 
3.4/5.2 
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Although this way of classifying RFLPs gives a 
conservative estimate of the number of identified 
polymorphisms, for cosmids characterized by strong 
linkage disequilibrium, the number of ID events may be 
5 inflated at the expense of the actual number of PMs. To 

compensate for this, we performed a second set of 
calculations in which a polymorphic event must be 
detected with at least 3 enzymes to qualify as ID. 
RFLPs previously attributed to insertion-deletion events 

10 because detected with two enzymes, are now considered as 

two independent point mutations. 24 polymorphisms 
initially considered IDs fell into this category. 
Following this approach, 239 independent RFLPs were 
identified or 2.9 per cosmid, with now 87.9% of the PM 

15 type and 12.1% of the ID type. 

Table 5 reports the observed heterozygosities 
obtained with the generated multisite haplotype systems. 
These values correspond to the percentage individuals 
heterozygous for at least one of the polymorphisms 

20 identified with a given cosmid. Noteworthy, this 

parameter is not affected by the mode of classification 
of RFLPs in PMs or IDs. At this point and without 
segregation information, we can't dissect the 
heterozygous genotypes into their component haplotypes. 

25 These heterozygosities were estimated on a small sample 

and should therefore be considered cautiously. As 
pointed out by Skolnick and White (129) , the main 
advantage of working with a sample of 9 individuals is 
that it is sufficient to identify the majority of useful 

30 polymorphisms. However, the mean heterozygosity of 

51.9%, obtained over the 84 polymorphic cosmids 
demonstrates the power of the approach. 

The following numbers of RFLPs were detected by 
each enzyme, irrespective of PM or ID type: TaqI: 57, 

35 EcoRI: 37, Mspl: 33, Hindlll: 31, PvuII: 30, EcoRV: 29, 

BamHI: 26, Bglll: 24, Xbal: 21, PstI: 18, Kpnl: 16 and 
Bgll: 13. The number of polymorphisms detected with 
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Bglll and PstI were corrected to adjust for the lower 
number of probes used with these enzystes* 

Proper interpretation of the polymorphic patterns 
has been confirmed by segregation analysis in pedigree 
5 material for most of the described RFLPs (see 

hereafter) • 

RFLPs of the EM type were used to calculate 
nucleotide diversities as described in Materials and 
Methods. Two sets of values are reported, depending on 
10 which of the two criteria were used to classify an RFLP 

into the PM or ID type. Global nucleotide diversities 
of respectively 0.000652 and 0.000846 were obtained, 
meaning that a randomly selected Holstein animal will be 
heterozygous for approximately 1 every 1200 to 1500 base 
15 pairs. As ejected because of the presence of an 

hypermuteJDle cytosine followed by guanine in their 
recognition sequence, nucleotide diversities more than 
twice as high are obtained when combining data obtained 
with the enzymes Mspl and TaqI: 0.001493 and 0.002239 
20 respectively (5) . On the other hand, the recognition 

sequence of the enzymes Bglll, Hindlll, PstI, PvuII and 
Xbal are devoid of hypermutable cytosines in the CpG 
dinucleotide and yield combined nucleotide diversities 
less than one third the values found with Mspl and TaqI 
25 (0.000492 and 0.000648). Using these two sets of 

values, one can extrapolate what nucleotide diversities 
would be obtained if sampling hypothetical sec[uences 
composed entirely of hypermutable cytosines, giving 
respectively 0.004496 and 0.007012- Assuming that the 
30 majority of detected polymorphisms behave according to 

the neutral mutation-random drift hypothesis, nucleotide 
diversity and mutation rate are simply related as: 

IT « 4Ne/x, 

where Ne stands for the effective population size and 
35 for the mutation rate (1) , Therefore, our data allow us 

to estimate that cytosines followed by guanines mutate 
at a rate approximately 10 times higher compared to 
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other nucleotides, presumably because a substantial 
fraction of these are methylated in the germline and 
prone to mutate to thymidine by spontaneous deamination. 

DISCUSSION 

5 We demonstrate in this work that large numbers of 

DNA markers with very acceptable Polymorphism 
Information Content can be quickly generated using 
large, randomly selected genomic probes in Southern blot 
hybridization experiments. The multisite haplotypes 

10 identified in this study using cosmid probes have a mean 

heterozygosity of 51.9% This value is of the same order 
of magnitude as the heterozygosities that we have 
obtained with a panel of approximately 40 bovine 
variable Niomber of Tandem Repeat markers (mean 

15 heterozygosity 59%; 150), and with more than 50 bovine 

(TG) -dinucleotide microsatellites (mean heterozygosity 
56%; unpublished) . 

A remarkably high proportion of the tested cosmids 
proved informative: 74.5% of all tested clones, and as 

20 high as 85% when considering only the clones giving 

readable patterns. Compared to strategies aimed at 
isolating hypermutable sequences such as mini- or 
microsatellites, very little time and effort is wasted 
into candidate clones which have to be dropped at a 

25 later stage. 

Because the cosmid clones used as probes are 
selected at random, we can reasonably assume that the 
coverage obtained with the generated markers is fairly 
uniform, Monte-Carlo simulations allow us to estimate 

30 that these 82 markers are covering 29%, 47% or 60% of 

the bovine genome in linkage studies if a maximum of 
respectively 5, 10 and 15 cM are scanned on each side of 
each marker, and assuming a total bovine map length of 
25 Morgan as deduced from chiasmata counts (151) . 
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The southern blot hybridization procedure used for the 
detection of these RFLPs is a very mature and robust 
methodology, allowing the treatment of very large 
numbers of samples simultaneously, and benefitting from 
5 intrinsic "multiplexing" properties especially when 

using nylon membranes. Indeed, and despite variations 
between batches, we are routinely using membranes for 10 
or more hybridization cycles. 

The main disadvantage of multisite haplotypes is 
10 the requirement to use several restriction enzymes to 

fully exploit their PIC. This increases costs, amounts 
of required DNA, complicates the organization of 
genotype collection and their subsequent use in linkage 
zinalysis. 

15 The fact that 75 to 85% of the cosmids tested in 

cattle reveal polymorphism, compares favorably with 
results previously reported in the human. Schumm et al. 
(131) for instance report that 30% of the 1664 lambda 
clones they tested in a sample of 5 individuals, gave 

20 polymorphic patterns. Adjusting for the sample size and 

a ratio of approximately 2.5 between cosmid and phage 
insert size, the two figures are probably fairly 
similar. Surprisingly, only 54 of 101 human cosmid 
clones tested by the same group (152) were revealing 

25 RFLPs when tested with 9 restriction enzymes, versus 

74.5% in our study with, however, 12 enzymes. It has 
been speculated that the relatively low level of 
polymorphism found in this study might result from a 
biais against human methylated sequences (including CpG 

30 present in the recognition sequence of TaqI and Mspl) 

when construction the cosmid library, due to the active 
modified cytosine restriction system (mcr) of the E.Coli 

1046 host (152) . 

These results are quite unexpected. Indeed, 
35 because of the population structure imposed by breeding 

strategies, the effective population size, Ne, in cattle 
is expected to be significantly lower than in the human. 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 PCr/US92/00340 

-91- 

In American Holstein, Ne is estimated between 10^ and 
2x10^ (Ina Hoeschele, personal coinmunication) • This 
value has to be compared with an estimated Ne of 10* in 
the hxman (18) . Assuming identical mutation rates, this 
5 reduction of Ne should be accompanied by a concomitant 

reduction in genetic variation. As a matter of fact, we 
obtain estimates of global nucleotide diversity between 
0.000652 and 0.000846 which are between 3.5 and 2.5 
times lower than values typically found in human 

10 populations (2, 119, 132). This confirms our previous 

results in another cattle population: the Belgian Blue 
Cattle breed (3, 4) . 

At least part of the discrepancy between an 
apparently reduced nucleotide diversity but similar RFLP 

15 frequency, may be accounted for by the apparently higher 

frequency of insertion-deletion events found in cattle 
compared to human. Schumm et al. (131) report that 58 
out of the 515 polymorphic loci (11.26%), show 
insertion-deletion type RFLPs; other groups report even 

20 lower frequencies of such events in the human (R.White, 

personal communication) . . In cattle, we found that 29 
(35%) to 46 (56%) out of 82 polymorphic cosmids show 
such insertion-deletion events, depending on whether an 
ID-type polymorphism has to be detected with two or more 

25 enzymes. These results seems to point towards a 

fundamentally different property of both genomes. It is 
tempting to speculate that this high level of insertion- 
deletions in the bovine' genome reflect the activity of 
a mobile element. Analysis of the restriction patterns 

30 characterizing these ID events, however, does not reveal 

any typical, recurrent "signature" of such an element. 

Altogether, our laboratory has now isolated more 
than 200 DNA markers for cattle with a mean 
heterozygosity above 50%: 82 multisite haplotypes, 40 

35 Variable Number of Tandem Repeat markers (150) and more 

than 80 dinucleotide microsatellites (unpublished) . The 
coverage of the bovine genome obtained with increasing 
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niUQber of randomly selected probes (0 to 500) , was 
estimated by Monte-Carlo simulation assinaing that the 
used family material is sufficent to detect linkage at 
respectively 5, 10 and 15 cM, andg a total bovine map 
5 length of 25 Morgan as deduced from chiasmata counts 

(151) , divided over the 30 bovine chromosomes according 
to their relative length. With the PIC characterizing 
our marker set, we feel fairly comfortable that in the 
majority of situations we will be able^ to cover genetic 
10 distances of the order of 10 cM or more, especially if 

applying multilocus or interval mapping techniques (66, 
68) . Therefore our panel of probes should cover around 
75% of the bovine genome. 

It is obvious that we are approaching a point where 
15 the efficiency of a strategy based on the further 

accumulation of random markers become questioneJDle. 
After 200 probes, the additional coverage obtained per 
new marker, expressed as a fraction of the maximum 
coverage possible, now is approximately 1/5 of the 
20 coverage obtained when we started this project. This 

creates the need for more targeted approaches. In this 
regard ,^ mappers of domestic animal genomes will benefit 
from the human mapping efforts and the remarkable 
chromosomal conservation observed within mammals (133) . 
25 Based on comparative mapping information, it should be 

possible to identify genes likely located in the "holes" 
left by the random approach and to generate multisite 
haplotypes or microsatellite markers around their bovine 
homologues. We Eire presently exploring the feasibility 
30 of such approaches. 

With the markers available today, a substantial 
part of the bovine genome is now amenable to linkage 
scanning, which will hopefully allow the mapping of 
Economic Trait Loci and testing the feasibility of 
35 Marker Assisted Selection schemes. 
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EXAMPLE 3 

CLONIMG, CILRACTERIZATION AND "IN VITRO" AMPLIFICATION 
OF BOVINE MICR08ATELLITES 

INTRODUCTION 

Recently, microsatellites were proven to be an 
ab\mdant source of highly polymorphic markers in the 
human (32-34). As their nsme implies, microsatellites 
are minute VNTR markers (18-20) , characterized by tandem 
repetitions of very short repeats, one to four base 
pairs in length. Microsatellites exhibit levels of 
polymorphism comparable to VNTRs, but are much more 
abundant and apparently evenly spread throughout the 
genome. 

We describe the cloning, caracterization and "in 
vitro" amplification of more than 100 such bovine 
microsatellites . 

MATERIALS AND METHODS 

1. DNA Database Search 

20 Bovine and ovine sequences in the EMBL and Genbank 

(version 64.0) were searched for all types of dinucleo- 
tide and trinucleotide repeats using the Intelligenetics 
software, release 5.37. The minimum number of repeats 
was set at six. Six bovine sequences, characterized by 

25 the longest microsatellites, were retained for further 

analysis and are listed in Table 6. 

TABH p: 6 

GBCyP21 GBIRBP 
GBFSH GBKCAS 
30 GBGAPR GBPRLGR 



10 
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2* isolation of Bovine icicrosatellites 

Bovine genomic DNA was digested to completion with 
Mbo l and size-fractionated by agarose-gel electrophore- 
sis • Fragments between 250 and 500 base pairs were 
5 recovered and purified using **Gene-Cleem" , ligated into 

the BAP-dephosphorylated BamH I site of pUC13 (Pharma- 
cia) , and cloned into 1. Coli DH5a cells (BRL) . The 
resulting clones were screened for the presence of (TG)^ 
microsatellites using a ^^P kinased (AC) 15 oligonucleotide 
10 as probe, and for (AG)n microsatellites using a (TC)15 

probe. The library was made with female DNA to avoid 
the previously characterized Y-specific TG-rich bovine 
DYZl sequence (117) - 

3. Beguenoing of Bovine Hicrosatellites 

15 Positive clones were sequenced using one of the 

following procedures: 

(a) Plasmid DNA was prepared using standard "boil- 
ing mini-prep" procedures and suibjected to two chain- 
termination sequencing reactions using unmodified T7 DNA 

20 Polymerase (Pharmacia), with the "universal" and "re- 

verse" sequencing primers, respectively. The ^^S labeled 
sequencing products were analyzed on standard denaturing 
polyacryl amide sequencing gels and detected by auto- 
radiography. 

25 (b) Macmetic solid-phase sequencing (137) . Alterna- 

tively, positive colonies were grown in microtiter- 
format using standard procedures in order to establish 
glycerol stocks emd S/xl of culture directly subjected to 
PCR amplifications using the following vector-specific 

30 primers: 

UNIBIS: 5'-GATGTGCTGCAAGGCGATTAAGTTG-3' 
REVBIS : 5 ' -CGGCTCGTATGTTGTGTGGAATTGT-3 ' 

Two 30 cycle amplifications were carried out per clone, 
one with the UNIBIS primer biotinylated, the other with 
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the REVBIS primer biotinylated. Denaturation was at 93 
for 1 Bin, (except for the first cycle: 95'C for 5 min. ) 
annealing at 60*C for 2 min., and extension at 72 *C for 
2 min. All of the PGR reactions were performed in micro- 
5 titer-format using the TECHNE MW-2 heating device. The 

biotinilated strand of the PCR-product was captured 
using the DYNAL streptavidin-coated magnetic beads 
according to the manufacturer recommended conditions and 
sequenced using unmodified T7 DNA polymerase (Pharmacia) 
10 as specified by the manufacturer. 

4. Amplification and Detection 
of Bovine Microsatellites 

(a) Simplex Amplification . The generated sequences 

are organized in the following way: 

15 5'-. . . . (UP) {TG)n -3' 

3'- (AC)^,. . . . . (DN) -5' 

Suitable primers for in vitro amplification are identi- 
fied in "UPSTREAM (UP)" and "DOWNSTREAM (DN)" strands 
using the "OPTIPRIM" program designed by one of us. 
20 Given sequence information flanking a DNA stretch, 

"Optiprim" helps the user to identify suiteQ^le primer 
pairs for PCR euoaplif ication of the corresponding DNA 
stretch. 

Description of the program : The two DNA sequences 
25 flanking the DNA stretch to be amplified are referred to 

as the upstream (UP) and downstream (DN) sequence, res- 
pectively. Both for UP and DN, Optiprim tests all 
possible primers of given length (as defined by the 
user) and retains the primers (1) with melting tempera- 
30 ture (Tm) within the range defined by the user (Tm is 

calculated as 2C x number of As, or Ts + 4C x number of 
Gs or Cs) ; (2) with a minimtim percentage of each nucleo- 
tide as defined by the user, and (3) which cannot form 
secondary bonds that can be formed between two molecules 
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of the defined primer when sliding them in antiparallel 
orientation against each other, as illustrated in the 
following: 

5'"PRIMER-3' > 

5 < 3'-REMIRP-5' 



An A facing a T contributes two hydrogen bonds, and 
a G facing a C contributes three hydrogen bonds* No loop 
formation is considered when performing this analysis* 
This generates two sets of selected primers: an UP set 
10 and a DN set. All possible pairs of one UP and one Dn 

primer are then tested. Optiprim retains the primer 
pairs if (1) the difference between melting temperatures 
of the two primers is within a range defined by the 
user, (2) the two primers cannot form secondary struc- 
15 tures, determined as for individual primers, except that 

now the UP primer is slided versus the DN primer. Using 
this program, 80% of the selected primer pairs were 
giving successful PGR amplification in our microsatel* 
lite systems. The following criteria are considered by 
20 "OPTIPRIM" when searching for primers: primer length, 

melting temperature and secondary structures that can be 
formed within and between primers* The selected primers 
are synthesized by phosphoramidite chemistry on Applied 
Biosystem synthesizers and used without further puri- 
25 fication. The microsatellites are amplified jjk vitro, 

in microtiter plates and using the Techne Mlf2 device, in 
the following conditions (typically, 30/il reactions) : 
Target DNA 50 ng-lOOng 

KCl 50mM 
30 Tris-HCl, pH 8.4 lOmM 

MgC12 l.SmM 
Gelatine 0.01% 
dNTP 200MM each 

Primers l/iM each 

35 dCTP^^ 2MCi/30Ml 

Amplitaq lU/30/il 
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Thirty cycle amplifications are performed, characterized 
by a 93 *C denaturation for 1 min* (except for the first 
cycle; 95'C, 5 min.)f annealing at 55*C^ 60*C or 65'C for 
2 min. depending on the primers, and extension at 72 'C 
5 for 2 min. Annealing temperatures are reduced by 5*C 

when using bovine primers on ovine target DNA. 

(b) Multiplex Amolif ication > When performing multi- 
plex amplifications, concentrations of KCL, Tris-HCl, 
MgCl2, gelatine and dNTPs are increased by 50%, while 

10 the primer concentrations are decreased to leOfM each* 

(c) Detection of Microsatelllte Products, 2m1 of 
PGR reaction are mixed with the same volume of formamide 
dye and run in a denaturing 7% acrylamide, 32% forma- 
mide, 5.6 M urea, 13.5 mM Tris, 4.5 mM Boric Acid, 250 

15 /LiM EDTA gel. ^^P labeled products are detected by auto- 

radiography. 



RESULTS AND DISCUSSION 

1. Cloning and sequence caraoterisation of bovine 
miorosatellites ; 

20 A library of Mbo l fragments between 250 and 500 bp 

was screened with the oligonucleotide probes. One out 
of 50 clones cross-hybridized. Assuming independent 
distribution of microsatellites and Mbo l sites, the 
frequency of (TG)„^ microsatellites in the bovine genome 

25 is estimated to be at >100,000. Table 7 summarizes the 

sequence information available for about 230 such bovine 
microsatellites. For each of these, sufficient sequence 
information has been gathered to generate the required 
primers for PCR amplification of the corresponding 

30 microsatelllte. All sequences were generated by 

sequencing as described above or by screening GENEBANK. 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 



-98- 



PCr/US92/00340 



TABLE 7 
Bovine Microsatelli.tes 



Sequence Identification Numbers 



Name 


Up 


Repeat 


Down 


AGLA13 


1 


2 


3 


AGLA17 


4 


5 


6 


AGLA206 


7 


8 


9 


AGLA209 


10 


11 


12 


AGLA212 


13 


14 


15 


AGLA215 


16 


17 


18 


AGLA217 


19 


20 


21 , 


AGLA2X8 


22 


23 


24 


AGIiA22 


25 


26 


27 


AGLA220 


28 


29 


30 


AGLA223 


31 


32 


33 


AGLA226 


34 


35 


36 


AGliA227B 


37 


38 


39 


AGLA230 


40 


41 


42 


AGLA232 


43 


44 


45 


AGLA233 


46 


47 


48 


AGIA234 


49 


50 


51 


AGLA243 


52 


53 


54 


AGLA247 


55 


56 


57 


AGLA248 


58 


59 


60 


AGIiA254 


61 


62 


63 


AGLA255 


64 


65 


66 


AGIiA257 


67 


68 


69 


AGLA258 


70 


71 


72 


AGLA259 


73 


74 


75 


AGLA260 


76 


77 


78 


AGLA267 


79 


80 


81 


AGLA269 


82 


83 


84 


AGLA272 


85 


86 


87 


AGLA280 


88 


89 


90 


AGLA284 


91 


92 


93 


AGIxA285 


94 


95 


96 


AGIiA29 


97 


98 


99 


AGLA291 


100 


101 


102 


AGLA293 


103 


104 


105 


AGLA296 


106 


107 


108 


AGLA298 


109 


110 


111 


AGLA299 


112 


113 


114 


AGLA300 


115 


116 


117 


AGIiA33 


118 


119 


120 


AGLA8 


121 


122 


123 
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TABLE 7 


(Continued) 




/-* T-% -r*1 O n 

GBFSH 






IOC 


GBIRBP 


127 


1 0 ft 


1 0 Q 


GBKCAS 


n 0 rt 

loU 


J- JX 




GBPRLGR 


1 Jo 








xo 0 


137 






139 


140 


141 






1 4*^ 




MGTG13B 


1 A K. 




1/1*7 


MGTG3 


148 


i4y 


lou 


MGTG4B 


151 


152 


153 


MGTG7 


154 


155 


156 


TGLAIO 


157 


158 


159 


TGLA102 


160 


161 


162 


TGLAIO 9 


163 


164 


165 


TGLAllO 


166 


167 


168 


TGLAlll 


169 


170 , 


171 


TGIA112 


172 


173 


174 


TGLA116 


175 


17 6 


177 


TGLA117 


178 


179 


180 


TGliA12 


181 


182 . 


183 


TGLA122 


184 


185 


186 


TGLA123 


187 


188 


189 


TGLA124 


190 


191 


192 


TGI1AI25 


193 


194 


195 


TGLA126 


196 


197 


198 


TGLA127 


199 


200 


201 


TGLA128 


202 


203 


204 


TGLA13 


205 


206 


207 


TGI1AI3O 


208 


209 


210 


TGIiAl31 


211 


212 


213 


TGIiAl32 


214 


215 


216 


TGLA134 


217 


218 


219 


TGLA135 


220 


221 


222 


TGLA137 


223 


224 


225 


TGLA141 


226 


227 


228 


TGLA142 


229 


230 


231 


TGLA147 


232 


233 


234 


TGLA149 


235 


236 


237 


TGIA15 


238 


239 


240 


TGLA153 


241 


242 


243 
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TABLE 7 


(Continued) 




TGLA154 


244 


245 


246 


TGLA158 


247 


248 


249 


TGLA159 


250 


251 


252 


TGIA160 


253 


254 


255 


TGLA162 


256 


257 


258 


TGLA164 


259 


260 


261 


TGLAl? 


262 


263 


264 


TGLAITO 


265 


266 


267 


TGLAITI 


268 


269 


270 


TGIA172 


271 


272 


273 


TGLA175 


274 


275 


276 


TGLA176 


277 


278 


279 


TGLAl 7 9 


280 


281 


282 


TGI1AI82 


283 


284 


285 


TGLAl 8 8 


286 


287 


288 


TGLAl 8 9 


289 


290 


291 


TGLA2 


292 


293 


294 


TGLA20 


295 


296 


297 


TGLA203 


298 


299 


300 


TGLA206 


301 


302 


303 


TGLA208 


304 


305 


306 


TGLA210 


307 


308 


309 


TGLA213 


310 


311 


312 


TGI1A214 


313 


314 


315 


TGLA215 


316 


317 


318 


TGLA22 


319 


320 


321 


TGLA222 


322 


323 


324 


TGLA226 


325 


326 


327 


TGLA227 


328 


329 


330 


TGLA23 


331 


332 


333 


TGLA231 


334 


335 


336 


TGLA245 


337 


. 338 


339 


TGLA25 


340 


341 


342 


TGLA254 


343 


344 


345 


TGLA255 


346 


347 


348 


TGLA257 


349 


350 


351 


TGLA26 


352 


353 


354 
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TABLE 7 (Continued) 





•JyJ-J 


U«J o 






•jdo 


R Q 


J bu 




oox 




Obs3 




Jo4 


Jbo 


o ^ c 
366 


TGLA^ DO 


ob / 


'3 /TO 


'5 t Q 


TGLAZV 


•3 T A 


o / 1 


372 


TGLA272 


TT3 
J / J 


•3 T /* 


375 


»nr*T jv o o 


o /b 




378 


TG1jA3 


37y 


^ o r> 


381 


TGLA301 


382 


383 


384 


TGLA303 


one 

3B5 


386 


387 


TGLA304 


o o o 
388 


389 


390 


TGLAS 0 6 


391 


o m 
392 


393 


TGLA307 


394 


395 


396 


TGLA309 


397 


398 


399 


TGliA3l 


400 


401 


402 


TGLA310 


403 


404 


405 


TGLA311 


406 


.407 


408 


TGLA318 


409 


410 


411 


TGLA322 


412 


413 


414 


TGLA323 


415 


416 


417 


TGLA325 


418 


419 


420 


TGLA327 


421 


422 


423 


TGLA328 


424 


425 


426 


TGLA332 


427 


428 


429 


TGLA334 


430 


431 


432 


TGLA337 


433 


434 


435 


TGLA339 


436 


437 


438 


TGLA34 


439 


440 


441 


TGLA340 


442 


443 


444 


TGliA341 


445 


446 


447 


TGLA342 


448 


44 9 


450 
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TGLA345 


451 


452 


453 


TGIiA346 


454 


455 


456 


TGLA35 


457 


458 


459 


TGLA351 


460 


461 


462 


TGLA353 


463 


464 


465 


TGLA354 


466 


467 


468 


TGLA357 


469 


470 


471 


TGLA36 


472 


473 


474 


TGIiA37 


475 


476 


477 


TGLA377 


478 


479 


480 


TGLA378 


481 


482 


483 


TGLA380 


484 


485 


486 


TGLA381 


487 


488 


489 


TGLA382 


490 


491 


492 


TGIiA387 


493 


494 


495 


TGIiA39 


496 


497 


498 


TGIxA394 


499 


500 


501 


TGLA4 


502 


503 


504 


TGLA40 


505 


506 


507 


TGIA400 


508 


509 


510 


TGLA414 


511 


512 


513 


TGLA415 


514 


515 


516 


TGIiA417 


517 


518 


519 


TGLA419 


520 


521 


522 


TGLA420 


523 


524 


525 


TGLA421 ^ 


526 


527 


528 


TGLA423 


529 


530 


531 


TGLA424 


532 


533 


534 


TGLA427 


535 


536 


537 


TGIA429 


538 


539 


540 
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TGLA431 


541 


542 


543 


TGLA432 


544 


545 


546 


TGLA433 


547 


548 


54 9 


TGLA435 


550 


551 


552 


TGLA436 


553 


554 


555 


TGLA437 


556 


557 


558 


TGLA438 


559 


560 


561 


TGLA44 


562 


563 


564 


TGLA441 


565 


566 


567 


TGLA443 


568 


569 


S70 




571 


572 


R7? 








o / 0 




u 1 f 


J / D 


R7 Q 




<j 0 u 


^ 0 X 




TGLA4 7 


583 


584 


— > 0 J 


TGI.A48 


586 


587 


588 


TGIA49 


589 


590 


591 


TGLA5 


592 


593 


594 


TGLA51 


595 


596 


597 


TGLA52 


598 


599 


600 


TGLA53 


601 


602 


603 


TGLA54 


604 


605 


606 


TGLA58 


607 


608 


609 


TGIiA6 


610 


611 


612 


TGI1A6OA 


613 


614 


615 


TGI1A6OB 


616 


617 


618 


TGLA61 


619 


620 


621 


TGLA66A 


622 


623 


624 


TGLA67 


625 


626 


627 


TGLA68 


628 


629 


630 


TGLA69 


631 


632 


633 


TGLA70A 


634 


635 


636 


TGLA70B 


637 


638 


639 


TGLA72 


640 


641 


642 
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TGIA73 


643 


644 


645 


TGLA75 


646 


647 


648 


TGLA76 


649 


650 


651 


TGLA77 


652 


653 


654 


TGLA78 


655 ^ 


656 


657 


TGLA79 


658 


659 


660 


TCLA8 


661 


662 


663 


TGLA80 


664 


665 


666 


TGLA82 


667 


668 


669 


TGLA84 


670 


671 


672 


TGIiASS 


673 


674 


675 


TGLA86 


676 


677 


678 


TGIiA89 


679 


680 


681 


TGLA9 


682 


683 


684 


TGIiA94 


685 


686 


687 


TGLA98 


688 


689 


690 


TGLA99 


691 


692 


693 


TGLB84 


694 


695 


696 
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2. PCR-Ampllfication and Detection of 
Miorosatellltes 

(a) gApipJ-ex €^inp3,i£jlc^tji.QTi 

Table 8 reports a preliminary list of bovine micro- 
5 satellite systems that were successfully amplified in 

vitro , with the corresponding primer pairs. Note that 
pairs of primers selected by "OPTIPRIM", allow success- 
ful amplification in at least one of our standard con- 
ditions more than 80% of the time. Table 9 also gives 
10 the favoured annealing temperature (using the TECHNE MW2 

heating device) . The mean heterozygosity for the bovine 
microsatellites was estimated at «50%. 
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TABLE 8 

PCR Amplified Bovine Microsatellites 





Tin 


Ud 


Down 


Down 








P T" 1 tno y 

IT X XUICSX 


Pt't ttipt 


Pxlinsir 








iicuut? 




TD 




ID 


Annealing 








fXUXinX X 


Q 


55 


60 








A(^LA2 0 fiDNI 


700 


55 


60 






701 


AlJX//»^ V ^x/ X 


709 


55 


60 








ACT.A91 


70A 


60 




Ad A91 "7 


ACT A"5 1 ^TTPI 




ACT.A91 TnMi 
AVjXviZ X / i-IlN X 


70f; 


55 


60 




AfiT.A97TTPT 

AVj^jftA ^ UIT J. 


707 


APJT.A^^nTJl 
AVjXin^ ^ x^iii X 


70fl 


60 






A{t;T.A'>'? KTTP1 


70Q 


ACT.A99 f^nWI 


710 

1 XVJ 


55 


60 




Al^T.A9'^4rTPl 


711 


Afjr.A7'^4nTiri 

Avrxixua .J ^ ui\ X 


712 


55 


60 


AGLA254 


■TlVJJjflo J " UXT X 


713 


AGLA254DK1 


714 


55 


60 




A/^T A9^^TTP1 
aVTJj/I^ J - J LI Xr Jl 


71 *^ 

/ x^ 


Af^T.A9RSnTJl 


71 

/ X 0 


55 


60 




Ar;T.A7*iRrrPi 


717 


ACLA2'ifiDNl 


718 


55 


60 




AfiT.A7finnP1 
/^oxjxI^ u u Liir X 


71 Q 

f x^ 


Af;T,A?fionKi 


720 


55 


60 




AoXin^ D ^ UXr X 


791 

/ 4l X 


Af^T.APfiQniJI 

AuXtxl^ U Zrl^lX X 


799 


55 


60 




ACT A9ft/ITTP1 


79T 


ACT A9flAnM1 
AVaXui^ o Hurt x 


79il 


55 


60 






i 4uO 


a LOO iNTiWt 


IOC 


55 


60 




TV /^T IS o Otto t 


id, t 


AuliiV^7x'Nx 


TOO 


55 


60 




AljliAZ y XUJfX 


/ 


ACT B9 QT T^W1 




55 


60 


ART A^O'^ 


ACT A90*^rTD1 


7^1 
/ JX 


ACT-AOQ'^rjMI 

^LUrJjxtX 70X/iV X 


7^9 
/ oz 


55 


60 


Titzi aft 


ACT AftTTDI 


/ 


ACT AflrkWI 




55 




vrDC on 






UI3I! OXli^lM X 


too 


55 








737 






60 








739 




740 


60 




MGTGl 


MGTG1UP3 


741 


MGTGIDNI 


742 


55 


60 


MGTG13B 


MGTG13BUP3 


743 


MGrG13BDN2 


744 


55 


60 


MGTG3 


MGTG3UP1 


745 


MGrG3DN2 


746 


55 


60 


MGTG4B 


MGTG4BUP2 


741 


MGTG4BDN2 


748 


55 


60 


MGTG7 


MGTG7UP3 


749 


MGTG7DN3 


750 


55 


60 


TGIAIO 


TGLAIOUPI 


751 


TGIAIODNI 


752 


60 




TGLAlll 


TGLAlllUPl 


753 


TGIAlllDNl 


754 


60 




TGIA116 


TGLA116UP1 


755 


TGLA116DN1 


756 


60 




TGIA117 


TGLA117UE1 


757 


TGIA117DN1 


758 


60 




TGIJU.2 


TGIiA12UPl 


759 


TGLA12DN2 


760 


55 




TGIA122 


TGLA122UP1 


751 


TGLA122D>fl 


762 


60 




TGLA123 


TGLA123DP1 


763 


TGLA123DN1 


764 


60 




TGLA124 


TGIiAl24UPl 


765 


TGIiA124DNl 


766 


60 




TGIA125 


TGLA125DP2 


767 


TGIiA125DN2 


768 


55 


60 


TGLA126 


TGLA126UP1 


769 


TGIiA126DKl 


770 


55 




TGIA127 


TGIiA127UPl 


771 


TGIiA127DNl 


772 


55 




TGLA128 


a:GIA128UPl 


773 


TGLA128PN1 


774 


60 




TGIA130 


TGIA130UP1 


775 


TGI1AI3ODNI 


776 


55 
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Z olJxvJ. 0 ^ U IT JU 


777 


,1 lit 1 tt\ L - 1 ^. 1 fVt 1 


778 










77Q 

1 f y 




1 ov 


DD 


DU 


TGLA137 


TGLA137UPX 


781 


TGIjAIjj /DNl 


TOO 


cr\ 
bO 




TGLA142 


TGLA142tJPl 


1 fin 

783 


TGIjA142DN1 


1 o4 


50 




TGLA147 


TGLA147UP1 


785 


TGLA147DN1 


786 


60 




TGLA15 


TGLA15UP2 


787 


TGLA15DN2 


788 


55 


60 


TGLA153 


TGI*A1531JP2 


789 


TGLA153DN2 


790 


60 




TGLA158 


TGlA158aPl 


791 


TGLA158DN1 


792 


60 




TGLA159 


TGLA159UP1 


793 


TGLA159DN1 


794 


55 


60 


TGLA164 


TGLA164UP1 


795 


TGLA164DN1 


795 


55 


60 


TGIA170 


TGLA170UP1 


797 


TGLA170DN1 


798 


60 




TGLA176 


TGLA176aPl 


799 


TGLA176DN1 


800 


60 




TGLA182 


TGLA182UP1 


801 


TGLA182DN1 


802 


60 




TGLA203 


TGLA203OT1 


803 


TGLA203DN1 


804 


60 




TGI1A2O6 


TGLA206UP1 


805 


TGLA206DN1 


806 


60 




TGLA210 


TGLA210UP1 


807 


TGLA210DN1 


808 


60 




TGLA214 


TGLA214UP1 


809 


TGLA214DN1 


810 


55 


60 


TGLA215 


TGLA215UP1 


811 


TGLA215DN1 


812 


55 


60 


TGLA22 


TGLA22UP1 


813 


TGLA22DN1 


814 


60 




TGLA227 


XGLA227UP1 


815 


TGLA227DN1 


816 


55 


60 


TGLA23 


TGLA23UP1 


817 


TGLA23DN1 


818 


60 




TGLA231 


TGLA231UP1 


819 


TGLA231DN1 


820 


55 


60 


TGLA245 


TGLA245UP1 


821 


TGLA245DN1 


822 


55 


60 


TGIiA260 


TGLA260UP1 


823 


TGLA2 60DN1 


824 


55 


60 


TGLA263 


TGLA263UP1 


825 


TGLA263DN1 


826 


55 


60 


TGI1A28 


TGLA28UP3 


827 


TGLA28DN2 


828 


55 


60 


TGLA303 


TGIiA303UPl 


829 


TGIiA303DNl 


830 


60 




TGliA304 


TGLA304UP1 


831 


TGLA304DN1 


832 


60 




TGLA307 


TGIA307UP1 


833 


TGLA307DN1 


834 


55 


60 


TGLA309 


TGLA309UP1 


835 


TGLA309DN1 


836 


55 


60 


TGIiA322 


TGLA322UP1 


837 


TGLA322DN1 


838 


55 


60 


TGI1A325 


TGLA325UP1 


839 


TGLA325DN1 


840 


55 


60 


TGLA327 


TGLA327UP1 


841 


TGLA327DN1 


842 


60 




TGIA328 


TGLA328tJPl 


843 


TGLA328DN1 


344 


55 


60 


TGLA334 


TGLA334UP1 


845 


TGLA334DM1 


846 


55 


60 


TGIiA337 


TGLA337DP1 


847 


TGLA337DN1 


848 


60 




TGLA339 


TGLA339UP1 


849 


TGLA339DN1 


850 


55 




TGLA34 


TGLA34UP2 


851 


TGLA34DN1 


852 


60 




TGLA340 


TGLA340UP1 


853 


TGXiA340DNl 


854 


55 


60 


TGLA341 


TGLA341UP1 


855 


TGX1A34IDNI 


856 


60 




TGLA342 


TGIiA342UPl 


857 


TGLA342DN1 


858 


60 




XGLA346 


TGLA346UP1 


859 


TGLA346DN1 


860 


60 




TGI1A35 


TGLA35UP1 


861 


TGIA35I)N1 


862 


60 
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TGLASSl 


TGLA351UP1 


863 


TGIiA35lDNl 




55 


50 


TGLA353 


TGLA353UP1 


865 


TGIiA353DNl 


bob 


cr\ 
oU 




TGLA354 


TGLA354UP1 


867 


TGIiA354DNl 


868 


55 


60 


TGIA357 


TGLA357UF1 


869 


TGLA357DN1 


870 


55 


60 


TGIiA3€ 


TGLA36UP1 


871 


TGLA36DK1 


872 


60 




TGLA37 


TGLA37UP1 


873 


TGLA37DN1 


874 


60 




TGIA377 


TGLA377UP1 


875 


TGIA377DN1 


876 


55 


60 


TGIJ1378 


TGLA378UP1 


877 


TGLA378DN1 


878 


60 




TGLA382 


TGLA382UP1 


879 


TGLA382DN1 


880 


60 




TGLA387 


TGLA387UP1 


881 


TGLA387DN1 


882 


60 




TGIiA40 


TGLA40UP1 


883 


TGIfA40DNl 


884 


60 




TGLA415 


TGLA415UP1 


885 


TGLA415DN1 


886 


55 


60 


TGIxA420 


TGLA420UP1 


887 


TGLA420DN1 


888 


55 


60 


TGIA421 


TGIiA421UPl 


889 


TGIiA421DNl 


890 


55 


60 


TGLA423 


TGLA423UP1 


891 


TGLA423DN1 


892 


55 


60 


TGLA431 


TGLA431UP1 


893 


TGIiA431DNl 


894 


55 


60 


TGLA433 


TGLA433UP1 


895 


TGLA433DN1 


896 


60 




TGIiA435 


TGLA435in?l 


897 


TGIiA4o ODNX 




oo 


bU 


TGliA44 


TGLA44UP2 


899 


TGLA44DN1 


900 


55 


60 


TGLA441 


TGLA441UP1 


901 


TGLA441DN1 


902 


60 




TGLA444 


TGIA444UP1 


903 


TGLA444DN1 


904 


55 


60 


TGLA45 


TGLA45UP1 


905 


TGLA45DN1 


906 


60 




TGLA47 


TGLA47DP1 


907 


TGIiA47DNl 


908 


55 


60 


TGLA48 


TGIA48DP1 


909 


TGLA48DN1 


910 


55 


60 


TGLA49 


TGLA49UP1 


911 


TGLA49DN2 


912 


55 




TGLA51 


TGLA51UP1 


913 


TGLA51DN1 


914 


60 




TGLA52 


TGLA52UP1 


915 


TGLA52DN1 


916 


55 




TGIA53 


TGLA53UP1 


917 


TGLA53DK1 


918 


55 


60 


TGLASB 


TGIA58UP1 


919 


TGLA58DN1 


920 


55 


60 


TGLA6 


TGLAGDPl 


921 


TGLA6DN1 


922 


60 




TGLA60A 


TGLA60AIJP1 


923 


TGLA60ADN1 


924 


55 




TGIA60B 


TGLA60BUP1 


925 


TGLA60BDN1 


926 


55 




TGIA61 


TGLAGIUPI 


927 


TGLA61DN1 


928 


55 


60 


TGLA67 


TGLA67UP1 


929 


TGLA67DN1 


930 


60 
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TGLA68 


TGLA6BUP1 


931 


TGLA68DN1 


932 


60 




TGLA72 


TGLA72UP1 


933 


TGLA72DN1 


934 


55 


60 


TGLA73 


TGLA73UP1 


935 


TGLA73DN1 


936 


55 




TGLA75 


TGLA75UP1 


937 


TGLA75DN1 


938 


55 




XGLA76 


TGIiA76UPl 


939 


TGLA76DN1 


940 


55 




TGIiA77 


TGLA77UP1 


941 


TGLA77DN1 


942 


55 


60 


TGLA80 


TGLA80UP1 


943 


TGLA80DN1 


944 


60 




TGLAB2 


TGLA82UP1 


945 


TGLA82DN1 


946 


55 




TGLA86 


TGLA86DP1 


947 


TGLA86DN1 


948 


55 


60 


TGI1A89 


TGLA89UP1 


949 


TGLA89DN1 


950 


52 
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(b) Multiplex ftptplj-ffj^catlon 

To increase the speed and lower the cost of geno- 
typing, mxxltiplex approaches for both amplification and 
data capture of micros atell it es are utilized* Micro- 
5 satellite systems yielding products of non-overlapping 

size were coamplified as described above* Preliminary 
results show that at least four different systems can 
easily be coamplified in these stemdard conditions. The 
following multiplex amplifications, for instance, were 
10 shown to yield consistent, easily interpretable results: 

a. GBCYP21 - TGLAIO - TGIA 44 - TGLA116 

b. TGIiA9 - MGTG4B - TGIiA23 - TGIA35 

c. MGTG3 - MGTG13B 

By limiting detection to a single detection procedure 
15 (autoradiography of ^^P-labeled product) , multiplex 

amplification is limited to systems yielding products of 
non-overlapping size. To overcome this limitation, 
alternative detection schemes are utilized. In particu- 
lar, the use of conf ocal microscopy to detect products 
20 Isibeled with laser-excitable fluorescent molecules (such 

as fluoresceine, rhodamine, ...) is used. The products 
can then be differentiated based on the specific excita- 
tion and emission spectra of the tagged fluorescent 
molecules. Using this approach detection of up to at 
25 least 20 different systems should is feasable. 

3. PCR-mapping of Bovine Hicro-satellltes Using 
Somatic Cell Hybrids 

Results of the concordancy analysis are summarized 
in Table 9. Synteny groups to which microsatellite 
30 systems most likely map as deduced from concordancy 

analysis are underlined. Clear-cut results were obtained 
for MGTG13B (U19 or chromosome 15) , TGLA6 (Ull) , TGIA9 
(U27), TGLAll (U16), TGIA22 (U26 or chromosome 26), 
TGLA23 (Ull) , TGLA36 (U27) , TGLA52 (U9 or chromosome 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 



PCr/US92/00340 



-ni- 
ls) . Results are less discriminating for the other 
systems • Host likely synteny groups are, however, 
mentioned. In addition, we know from the literature 
that GBKCAS maps to U15 or chromosome 6, and GBCyp21 to 
5 U20 or chromosome 23. 
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E^PLE 4 

CONSTRUCTION OF A PRIMARY BOVINE DWk MARKER MAP, 

Bovine pedigrees for a total of approximately 200 
individuals were genotyped for 150 of these markers as 
described. Pair-wise linkage analysis was performed 
using the LODSCORE program. Only lodscore values 
superior to 3 were considered significant. This 
generated a primary DNA marker map with 24 linkage 
groups counting two or more markers (15 assigned to 
specific chromosomes or synteny groups) , and 68 
singleton markers. Tcible 10 summarizes our findings. 
Linkage groups were assigned to specific chromosomes or 
synteny groups whenever that information was available. 

TABLE 10 
Primary Bovine DNA Marker Map 



CHR>/SYNT. LG MTVRPgfi 

Chr.2 1 GMBT28 

1 GMBT47 

1 GMBT61 

1 MSBT13 

1 TGLAll 

1 TGLA44 

1 TGLA61 

1 TGLA215 

1 TGLA159 

1 TGLA58 

1 TGLA60 

1 TGLA377 

1 MGTG4B 

2 TGLA116 
2 Weaver 

Chr.6 GBKCAS 

Chr.a GMBT17 
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Chr.lO 
5 Chr.l4 

Chr.l5 



10 

Chr.l9 
Chr.21 

15 



Chr.23 

20 



25 



30 



Chr.24 

35 
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TABLE 10 (Continued) 



LG 


MARKER 




GMBTIQ 










3 






noox J D 


A 
*k 


uor oil 


A 
*% 


nvT !h "7 c: 










tr 
i> 


GlinTiD 


K 

D 


GrlBTlo 
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GMBT39 
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c 






T/^ T 1 1 '5 •? 


O 




f. 
yj 




5 




5 


uxa.w L>xii 


5 




6 


w Uw X f ^ X 


6 




5 


MGTG7 


6 


T6IiA142 

c 


7 


GMBT41 


7 


MSBT6 


7 


AGLA29 


7 


TGLA126 


7 


TGLA153 


7 


TGLA214 


8 


GMBT5 


8 


MSBT33 
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CHR,/SYNT. 
Chr.26 

5 

Chr.X 

10 

Chr.y 
Ul 

15 

UIO 

20 

Ull 

25 U16 



U22 

30 

U23 



35 
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TABliE 10 (Continued) 



m 






GMBTll 




TGLA22 




TGIA51 




GMBT27 




TGLA68 




TGIA124 




TGLA72 




TGLA54 




TGLA89 




DYZl 


9 


GMBT42 


9 


MGTG13A 


9 


TGIiA53 


10 


GMBT7 


10 


GMBT26 


10 


MSBT122 


11 


TGLA52 


11 


TGLA57 


11 


TGLA415 


12 


TGLA6 


12 


TGLA23 




TGIA5 




GMBTl 


13 


GMBT53 


13 


TGLA206 




GMBT14 




GMBT8 




GMBT36 


14 


TGXA9 


14 


TGLA36 


15 


GMBT3 


15 


GMBT29 


15 


GMBT49 
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TABLE 10 (Continued) 





CHR./SYNT. 


LG 


MARKER 






16 


GMBT2 1 

V3flf X X. 


R 






w^loX*3 




? 


17 


VJlTl£} X «* *C 






1 7 


ITIOO J. -L<^ 






17 


TfiTAI fid 






17 


X wXJbnfiO 


1 

xu 




1 7 


X vvXin J U J 






1 R 


uinx>x xc 






1 R 
xo 






-p 


19 


AGIA226 






1 0 

X7 










nuxux 












■p 


21 


MSBTll 






21 


MSBT19 






21 


TGrA227 


20 


o 


22 


TGLA378 






22 


TGLA433 






23 


TGIA51 






23 


TGIA94 






24 


TGLA54 


25 




24 


TGIA68 



CHR./SYNT.: Chromosome or synteny group 
LG: Linkage group 

30 PQWgjLfygXQWS 

Samples of coli harboring clones of polymorphic 
bovid markers have been deposited on 17 January 1991 
with the American Type Culture Collection (Rockville^ 
Maryland) under accession numbers 68,514 and 68,515. 

35 Deposit of the clones is for the sake of completeness, 

but is not intended to limit the scope of the instant 
invention to said deposited materials. Access to said 
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cultures will be available during the pendency of the 
application to those determined by the Commissioner of 
Patents and Trademarks to be entitled thereto. All 
restrictions on availability will be removed upon grant 
of the application and said cultures will remain avail- 
able during the life of the patent. Nonviable or 
destroyed cultures will be replaced in kind. 

It will be appreciated that the methods and com- 
positions of the instant invention can be incorporated 
in the form of a variety of embodiments, only a few of 
which are disclosed herein. It will be apparent to the 
artisan that other embodiments exist and do not depart 
from the spirit of the invention. Thus the described 
embodiments are illustrative and should not be construed 
15 as restrictive. 



10 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 



-119- 



PCr/US92/00340 



TABU S II 
LIST OF REFERENCES 

1. NEI, M. (1987). Molecular Evolutionary Genetics , 
5 Coliunbia University Press, New York. 

2. KAZAZIAN, H.H. , et al. (1983). Evolution of Genes 
and Proteins (Ed. by M. Nei and R.K. Koehn) , pp. 
137-146. Sinauer, Sunderland, Mass. 

10 

3. GEORGES, M. , et al . (1987). Animal Genetics 18 : 
41-54. 

4. HILBERT, P., et al. (1989). Animal Genetics 20: 
15 383-394. 

5. BARKER, D. , et al. (1984). CgH 36: 131-138. 

6. RUBIN, G.M. (1983). Mobile Genetic Elements (Ed. 
20 by J. A. Shapiro), Academic Press, pp. 329-362. 

7. HAWLEY, R.G., et al. (1982). Proc. Nat. Acad. Sci,. 
USA 29:7425-7429. 

25 8. CANAANI, E. , et al. (1983). Proc. Nat. Acad. Sci, 

USA 80:7118-7122. 



9. YMER, S., et al. (1985). Nature 317 :255-258. 
30 10. BLATT, C. , et al. (1988). EMBO J. 7:4283-4290. 

11. STOCKING, C, et al. (1988). Cell 53 :869-879- 

12. COHEN, J.C., and VARMUS, H.E. (1979). Nature 278: 
35 418-423. 

13. LEHRMAN, M.A. , et al. (1987). Cell 1^:827-835. 

14. KA2AZIAN, H.H. , et al. (1988). Nature 332:164-166. 

40 

15. MORSE, B., et al. (1988). Nature 333:87-90, 

16. LIN, C.S., et al, (1988). Cell 54:153-159. 

45 17. WONG, Z., et al. (1990). Genomics 7:222-234. 

18. JEFFREYS, A. J. , et al. (1985). Nature 314 :67-73. 

19. JEFFREYS, A. J. , and WILSON, V. (1985). Nature 316 ; 
50 76-79. 

20. NAKAMURA, Y. , et al. (1987). Science 235:1616-1622. 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 



PCr/L'S92/00340 

-120- 



10 



21. VASSART, G. et al. (1987). Science 235; 683-684. 

22. SINGER, M. F. (1982). Tnt-A. Review of cvtoloav 76; 
67-111. 

23. lATHROP, M., et al. (1988). Genomics 2:157-164. 

24. lATHROP, M., et al. (1988). Genomics 3; 361-366. 

25. NAKAMORA, Y. , et al. (1988). Genomics 3; 342-346. 

26. HAKAMURA, Y. , et al. (1988). Genomics 2; 302-309. 
15 27. NAKAMURA, Y., et al. (1988). Genomics 3:67-71. 

28. 0'CX)NNELL, P., et al. (1987). Genomics 1:93-102. 

29. O'CONNELL, P., et al. (1989). Genomics 4:12-20. 

30. JULIER, C, et al. (1990). Proc. Nat. Acad. Sci. 
USA 87:4585-4589. 

31. GEORGES, M. et al. (1990). Genomics 6:461-474. 

32. WEBER, J.L. and MAY, P.E. et al. (1989). Affierjc^p 
JoTirnal of Hmnan Genet. 44s388-96. 

33. LITT, M. and Luty, J.A. (1989). Am^gjL<??n Jpurpal of 
30 Human Genet. M:397-401. 

34. TAUTZ et al. (1989). ^ucl. Acids Res. 17:6463-71. 

35. ROYLE, N.J- et al. (1988). Genomics 3:352-360. 

35 

36. NAKAMORA, Y. et al. (1987). Hue] . Acids Res. 15: 
2537-2547. 

37. ECONOMOU, E.P. , et al. (1990). Proc. Na^t Ac^d. 
40 Sci. USA 87:2951-2954. 

38. DOOLITTLE, W.F. (1985). Trends An Genetics — 1 

(3) : 64-65. 

45 39. JEFFREYS, A.J., et al. (1990). gel?. 60:473-485. 

40. SOUTHERN, E.M. (1975). Journal of Molecular BiolocfV 
98:503-517. 



20 



25 



50 



41. SAIKI, R.K., et al. (1988). gcjence 239:487. 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 



PCr/US92y00340 



-121" 



42. KREITMAN, M. , and AGUADE, (1986). Proc. Nat. 

Acad. Sci. USA Sl:3562-3566. 

5 43. SKOLNICK, M.H. , and WALLACE, R.B. (1988) Genomics 

2:273-279. 

44. GEORGES, M. , et al, (1990). 4th World Congress on 
Genetics applied to Livestock Production, Edin- 

10 burgh, Scotland, July 1990. 

45. THEIN, S.L. , and WALLACE, R.B. (1986). Human Gene - 
tic Diseases; A Practical Approach (Ed. by K.E, 
Davies) , IRL Press. 



15 



30 



46. SAIKI, R.K., et al. (1989). Proc. Nat. Acad. Sci, 
USA 86:6230-6234. 



47. NEWTON, C.R. , et al . (1989). Nucl. Acids Research 
20 17 f 7) ; 2503-2516. 

48- WU, D.Y., et al. (1989). Proc. Nat. Acad. Sci, USA 
86:2757-2760. 

25 49. GIBBS, R.A. , etal. (1989). Nucleic Acids Research 

17171:2437-2448. 



50. WU, D.Y., and WALLACE, R.B. (1989). Genomics 4: 
560-569. 

51. MYERS, R.M., et al. (1988). Genome Analysis (Ed. 
by K.E. Davies). IRL Press. 



52. ROBERTS, R-G., etal. (1989). Nucl. Acids Research 
35 171151:5961-5971. 

53. ORITA, M. , et al. (1989). Proc. Nat. Acad. Sci- 
USA 86: 2766-2770. 

40 54. ORITA, M. , etal. (1989). Genomics 5(4) :874-879- 

55. CHURCH, G.M. and GILBERT, W. (1984). Proc. Nat. 
Acad. Sci. USA 81:1991-1995. 

45 56. WONG, C. , et al. (1987). Nature 330:384-386. 

57. KIMURA, M. (1986). The Neutral Theory of Molecular 
Evolution, Cambridge University Press. 

50 58. NEI, M. (1983). Evolution of Genes and Proteins 

(Ed- by M.Nei and R.K.Koehn). Sinauer Associates, 
Sunderland. 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 PCr/US92/(M)340 

-122- 

59. HILL, R.E. , and HASTIE, N.D. (1987). liatars SafetSe- 
60. LATHROP, G.M. and LALOUEL, J.M. (1984). ^.J.Ifuman 
5 Genetics 36; 460-465. 

61. FLINT, J., et al. (1986). Nature 321; 744-750. 

62. HUGHES, A.L. , and NEI, M. (1988). Natvqre 22^:167- 
10 170. 

63. AJALA, F.J. (1972). Bp-hav. Genet. 2; 85-91. 

64. LI, H., et al. (1988). Nature 335:414. 

15 

65. MORTON, N.E. (1955) . American Journal of Human 
Genetics 7:277-318. 

66. LATHROP, G.M. , et al. (1984). Proc. Nat. Acad. Sci. 
20 101:3443-3446. 

67. LATHROP, G.M. , et al. (1985). American Journal of 
Human Genetics 37:482-498. 

25 68. LANDER, E.S. and GREEN, P. (1987). Proc. y^tr ftcad. 

Sci. USA 84:2363-2367. 

69. LIGHTER, P., et al. (1990). Science 247:64-69. 

30 70. COX, D.R., et al. (1990). Science 250:245-250. 

71. WOMACK, J.E. (1987). Developm ental Genetics 8:281- 
293. 

35 72. WHITE, R. , et al. (1985). NgtUEg 313:101-105. 

73. DONIS-KELLER, H. et al. (1987). Cell 51(3) :319- 
337. 

40 74. FRIES, R. , et al. (1989). Animal Genet. 20:3-29. 

75. LANDER, E.S., and BOTSTEIN, D. (1986). Cold Spring 
Harbor Symposia on Quantitative Biology 51:49-62. 

45 76. BOTSTEIN, D. , et al. (1980). Am. J, fluiB, Genet. 

32:314-331. 

77. WHITE, R., and CASKEY, C.T. (1988). Science 240: 
1483-1488. 

50 

78. PATERSOH, A.H. , et al. (1989). Mature 335:721-726. 

79. MARTIN, B. , et al. (1989). Science 2^:1725-1727. 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 



PCT/US92/00340 



10 



15 



20 



-123- 



80- HANSET, (1982), Proceedings of the 2nd World 

Congress on Genetics applied to Livestock Produc- 
tion, Madrid, Spain, 6: 439-453 • 

81. LANDER. E.S., and BOTSTEIN, D* (1989). Genetics 
121:185-199. 

82. SOLLER, M. , et al. (1976). Theoret. and Applied 
g^n^tjgi? 42: 35-39 . 

83. WELLER, J.I., et al. (1990). in press. 

84. ARNHEIM, N. , et al. (1985). Proc. Nat. Acad. Scl. 
USA 12:6970-6974. 

85. PLOTSKY, Y. , et al. (1990). Proceedings of the 4th 
World Congress on Genetics Applied to Livestock 
Production, Edinburgh, Scotland, July 1990. Volume 
XIII, 133-136. 

86. GELDERMANN, H., et al. (1985). Theoretical and 
Appli^q <?^n?1^i<?g 2fl:138-146. 



87. COWAN, C.W., et al. (1990). Theoret. Appl. Genet. 
25 22:577-582. 

88. HANSET, R. (1989). Annales de Medecine Veterinaire 
133:465-488. 

30 89. GROSCLAUDE, F. (1988). INRA Production Animale 

1X11:5-17. 

90. GIBSON, J. P., et al. (1990). Proceedings of the 4th 
World Congress on Genetics Applied to Livestock 

35 Production. Edinburgh, Scotland, July 1990. Volume 

XIV, 163-166. 

91. SOLLER, M. and BECKMAN, J.S. (1982) • Restriction 
Fragment Length Polymorphisms and Genetic Improve- 

40 ment, 2nd World Congress on Genetics Applied to 

Livestock Production, Madrid, 6:396-404. 

92. HERR, CM., et al. (1990). Proceedings of the 4th 
World Congress on Genetics Applied to Livestock 

45 Production, XVI: 334-343. 

93. MEDRANO, J.F., and AGUILAAR- CORDOVA, E. (1990). 
Biotechnology 8:144-146. 

50 94. ARCHIBALD, A.L. and IMLAH, P. (1985). Animal Blood 

Groups and Biochemical Genetics 1^:253-263. 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 



PCr/US92/00340 



-124- 



10 



15 



20 



25 



30 



35 



40 



45 



95. MacLENNAN, D.H. (1990). Nature 343:559-561.22 

96. SMITH, C, and SIMPSON, S.P. (1986). J. Animal 
Breeding Genet. 103:205-217. 

97. WOMACK, J.E., and MOLL, Y.O (1986). J.Bered. 77: 
2-7. 

98. WHITE, R. and LALOUEL, J.M. (1988). Ann. Rev, 
Genet, 44:259-79. 

99. COOPER, D-N- et al. (1985). Human Genet. ££: 201- 
205. 

100. RUSSE, I. (1983). Bibliotheca Anatomlca 2A: 77-92. 

101. BETTERIDGE, K. J. , et al. (1989). tToumal Repro- 
duction and Fertility SuppI. 38:87-98. 

102. EPPIG, J.J. (1989), J-Reprod. Pert. Suppl. 38:3-8. 

103. GOULD, K.L., et al. (1990). Science 250:1573-1575. 

104. HUNT, T. (1989). Mature 342:483-484. 

105. SEIDEL, G-E. (1981). Science 211:351-358. 

106. EPPIG, J.J. and SCHROEDER, A.C. (1989). Biol. 
Reprod. 41:268-276. 

107. CARROLL, J., et al. (1990). J9^ynftl of Reproduc- 
tion and Fertility 90:321-327- 

108. MORTON, D.B., et al. (1987). Veterinar y Record 
121:592-^593. 



109. GEORGES, M. , et al. (1988). J. Am. Vet. Med. Assoc. 
193:9. 

110. GEORGES, M. , et al. (1988). Cytoaenet . Cell Genet. 
17:127-131. 

111. FEINBERG and VOGELSTEIN (1983). Anal. Biochem. 
132 ; 6-13 . 

112. SEALEY, P.G., et al. (1985). pi^gl. Acids Res. 13: 
1805-1922. 

113. ODELBERG, S.J. et al. (1989). Genomics 5:914-924. 

114. THREADGILL, D.W. , et al. (1990). Genomics a:22-28. 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 PCT/US92/00340 

-125- 



115. FRIES, R., et al. (1986). Animal Genetics 17:287- 
294. 

116. ISCNDA (1989). International System for Cytogenetic 
5 Nomenclature of Domestic Animals, D. DiBerardlno et 

al., Eds., Cytoaen. Cell Genet. 53; 65-79 (1990), 

117. PERRET, J. et al. (1990). Genomics £:482-490. 

10 118. EVANS, W.J,, et al. (1981). Proc. Nat. Acad. Sci. 

USA 78:3748-3750. 



15 



30 



35 



119. CHAKRAVARTI, A. et al. (1984). Proc. Nat. Acad. 
Sci, USA £1:6085-6089. 



120. SHIN, et al. (19 ) 



121. JARMAN, et al. (19 ) 

20 122. BELL, et al. (19 ) 



123. COPPRETERS, W. , et al . (19 )- iVnil^^^^JL <?^n?ti<?s 21- 

29-30. 

25 124. JEFFREYS, A. J. , et al. (1988). Ha£jiEg 131:278-280. 

125. JEFFREYS, A.J., et al. (1988). Nature 222:278-280. 

126. HILLET et al. (1989). Anim. Genetics 20, 145-155. 

127. FARBER, CM., et al. (1989). Human Genet. 182:1-3. 



128. BISHOP, D.T., et al. (1983). Am. J. Human Genet. 
33:795-815. 

129. SKOLNICK, M.H. and WHITE, R. (1982). Cytoqenet. 
Cell Genet 32:58-67. 



130. EWENS, W.J., et al. ()1981). Proc. Nat. Acad. Sci. 
40 USA 78:3748-3750. 

131. SCHUMM, J.W. et al. (1988). Am . J . Hum . Genetics 42; 
143-159. 

45 132. COOPER, D.N. , et al. (1985). Httffi^n G^netAffg 69: 

201-205. 

133- WOMACK, J.E. (1987). Development. Genet. 8:281-293. 

50 134. RIORDAN, J.R. , et al. (1989). Science 245:1066- 

1073. 

135. SHOWS, T.B. , et al. (1983). Advances in Human 
Genetics 12:341-452. 

55 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 



PCT/US92/00340 



-126- 



10 



15 



20 



25 



30 



35 



40 



45 



136. FALCONER,. D.S, (1989). Introduct ion to Quantitative 
Genetics . 3rd Ed., Longman Scientific & Technical, 

137. HULTMAN, T- , et al. (1989). Nucl. Acids Research 
171111:4937-4946. 



138. ORKIN, S.H. (1986). Cell 47:845. 

139. QUESADA, M.A. et al. (1991). piotect^njLcw^s (in 
press) . 

140. HOESCHELE, I. and MEINERT, T.R. (1990) . J. Dairy 
Sci. 73:2503. 

141. GEORGES, M. and MASSEY, J.M. (1991). Therioaenoloav 
35:151. 

142. GEORGES, M. (1991). In Animal A pplications of 

Research in Mammalian Development , Pedersen et al. , 
Eds., Cold Spring Harbor LeODoratory Press, Cold 
Spring Harbour, New York, pp. 285-325. 

143. GEORGES, M. et al. (1988). Nucl. Acids Res. 15:7193. 

144. FRIES, R. et al. (1990). In Gene ftappipq; 
Strategies , Technicnies and Apnl ications , Schook, 
L.B. et al., Eds., Dekker, New York - 

145. JULIER, C. et al. (1990). Proc. Nat . Acad. Sci. USA 
87:4585-4589. 

146. WONG, Z. et al. (1986). Nucl. Aci ds Res. 14:4605- 
4616. 

147. WONG, Z. et al. (1987). Ann. Hum. Genet. 51:269- 
288. 

148. STEELE, M.R. and GEORGES, M. (1991). Genomics 
10:889-904. 

149. FRIES, R. et al. (1990). Genomics 8:403-406. 

150. GEORGES, M. et al. (1991). Genomics (in press). 

151. LOGUE, D.N. and HARVEY, M.J. A. (1978). J, Repyo. 
Fertil . 54:159-165. 

152. BOWDEN, D.W. et al. (1989). J. Human. Genet. 

44:671-678. 



09/30/2004, EAST Version: 1.4.1 



wo 92/13102 



PCr/US92/00340 



-127- 



SEQUENCE LISTING 



(1) GENERAL INFOKMATION: 

5 

(i) APPLICANT: Georges, Michel 
Massey, Joseph M 

(ii) TITLE OF INVENTION: POLYMORPHICDNA MARKERS IN 
10 BOVIDAE 

(iii) NUMBER OF SEQUENCES: 952 

(iv) CORRESPONDENCE ADDRESS: 
15 (A) ADDRESSEE: Venable, Baetjer, Howard & 



Civiletti 
1000 



(B) STREET: 1201 New York Avenue N.W., Suite 



(C) CITY: Washington 

20 (D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 22205 

(V) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Wordperfect 5.1 

30 (Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: WO 

(B) FILING DATE: 15-JAN-1992 

(C) CLASSIFICATION: 

35 (vii) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 642,342 

(B) FILING DATE: 15-JAN-1991 

(C) CLASSIFICATION: 

40 (viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Ihnen, Jeffrey L 

(B) REGISTRATION NUMBER: 28,957 

(C) REFERENCE/DOCKET NUMBER: 18681-91367 

45 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-962-4810 

(B) TELEFAX: 202-962-8300 
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TGTGTGTGTGTGTGTGTGTGTGTGT 

(2) INFORMATION FOR SEQ ID NO: 468: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bos taurus 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: TGLA354 (down) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 

TGCATAAGAGAAACCACTACAAGGAGAAGCCCACACATGGCAACTAGAGT 
ACCCGTGCTTGCCAAATATAGAGATAGCCCATGTGCAGCGATGAAGACCC 
AGCACAGCCAAAAATAAACATTTATTTATTTATTTTTGGCTGTATGTAT 

(2) INFORMATION FOR SEQ ID NO; 469: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bos taurus 

(vii) IMMEDIATE SOURCE: 

(B) CLONE; TGLA357 (up) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 
AGCTCAGAGGGCAAAAAGGTTTGGGGTGTAT6GG 
(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bos taurus 

(vii) IMMEDIATE SOURCE; 

(B) CLONE: TGLA357 (repeat) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 
TGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 
(2) INFORMATION FOR SEQ ID NO: 471: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL; NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bos taurus 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: TGLA357 (down) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 471: 

CCATGGTTTGTGCAGAGTCTGAGTTTAAACTTCTCTAACACCTTTGATCT 
CTCCCTCTGTCTCATCTAAGC 

(2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bos taurus 

(vii) IMMEDIATE SOURCE; 

(B) CLONE; TGLA36 (up) 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO; 472: 
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\ m T ig CI AI WED IS ; 

!• A set of nucleic acid fragments that hybridize to 
polymorphic loci in bovids, wherein said set com- 
5 prises fragments that hybridize to at least five 

unique loci and each fragment hybridizes to a locus 
comprising at least two alleles and with a hetero- 
zygosity of at least 50%, 

10 2- The set of claim 1, wherein said polymorphic loci 

are selected from the group consisting of VNTR 
loci, multisite haplotype loci, microsatellite loci 
and combinations thereof, 

15 3 . The set of claim 2 , wherein said polymorphic loci 

are VNTR loci. 

4 . The set of claim 3 , wherein said fragments are 
selected from the group of VNTR markers identified 

20 in Table 1. 

5. The set of claim 2, wherein said polymorphic loci 
are multisite haplotype loci, 

25 6. The set of claim 5, wherein said fragments are 

selected from the group of multisite haplotype 
markers identified in Table 5, 

7, The set of claim 2, wherein said polymorphic loci 
30 are microsatellite loci. 

8. The set of claim 7, wherein said fragments are 
selected from the group of microsatellite markers 
identified in Table 7. 



35 



9. The set of claim 1, wherein said bovids are 
bovines . 
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10. The set of claim 9, wherein said bovines are of the 
species Bos taurus . 

11. The set of claim 1, vherein said bovids are ovines. 

5 

12. The set of claim 1, wherein said fragments are 
obtained from a bovid genome. 

13. The set of claim 12, wherein said bovid is a 
10 bovine - 

14. The set of claim 13, wherein said bovine is of the 
species Bos taurus . 

15 15. The set of claim 2, wherein said fragments are 

selected from the group of VNTR markers identified 
in Table 1, the group of multisite haplotype 
markers identified in Table 5, the group of micro- 
satellite markers identified in Table 7, and 

20 combinations thereof. 

16. The set of claim 7, wherein said fragments are 
selected from the group of microsatellite markers 
identified in Table 8. 

25 

17- A synteny map of microsatellite markers identified 
in Table 9. 

18. A synteny map of VNTR markers identified in Table 
30 4. 

19. The microsatellite marker TGLA116 for the Weaver 
condition. 

35 20. The microsatellite marker TGLA116 for the QTL trait 

of enhanced milk production in Brown Swiss cattle. 
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21. A set of nucleic acid fragments comprising at least 
one fragment selected from the group consisting of 
the VNTR markers identified in Table 1, the multi- 
site haplotype markers identified in Table 5, and 

5 the microsatellite markers identified in Table 7. 

22 . A process for mapping quantitative traits in bovids 
which comprises using the set of claim 21. 

10 23. A process for genetic identification using the set 

of claim 21. 

24. A process for introducing a desired gene into a 
bovid which comprises using the set of claim 21. 



15 



25. The process of claim 21, which further comprises 
the use of velogenesis. 
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FIG. I 
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P 1 2 3 4 5 



Repent: G(TG)|3 TA (TG)^ T 
PCR-Froduct: 168bp 

FIG. 2 
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